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METHODS OF SCREENING NUCLEIC ACIDS USING MASS 
SPECTROMETRY 
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rights in this invention. 

Technical Field 

This invention relates generally to methods for screening nucleic acids for 
mutations by analyzing fragmented nucleic acids using mass spectrometry. 

INTRODUCTION 

Approximately 4.000 human disorders are attributed to genetic causes. 
Hundreds of genes responsible for various disorders have been mapped, and 
sequence information is being accumulated rapidly. A principal goal of the 
Human Genome Project is to find all genes associated with each disorder. The 
definitive diagnostic test for any specific genetic disease (or predisposition to 
disease) will be the identification of mutations in affected cells (hat result in 
alterations of gene function. Furthermore, response to specific medications may 
depend on die presence of mutations. Developing DNA (or RNA) screening as a 
practical tool for medical diagnostics requires a method that is inexpensive* 
accurate, expeditious, ami robust. 

Genetic mutations can manifest themselves in several forms, such as point 
mutations where a single base is changed to one of the three other bases, 
deletions where one or more bases are removed from a nucleic acid sequence and 
the bases flanking the deleted sequence are directly linked to each other, and 
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insertions where new bases are inserted at a particular point in a nucleic acid 
sequence adding additional length to tlx overall sequence. Large insertions and 
dclctiora, often the result of chromosomal recombination and rearrangement 
events, can lead to partial or complete loss of a gene. Of these form* of mutation, 
in general the most difficult type of mutation to screen for and detect is die point 
mutation because it represents the smallest degree of molecular change. The tetm 
mutation encompasses all the above-listed types of differences from wild type 
nucleic acid sequence. Wild type is a standard or reference nucleotide sequence to 
which variations are compared. As defircd, any variation from wild type is 
considered a mutation including naturally occurring sequence polymorphisms. 

Although a number of genetic defects can be linked to a specific single 
point mutation within a gene, e.g. sickle cell anemia, many are caused by a wide 
spectrum of different mutations throughout the gene. A typical gene that might be 
screened using the methods described here could be anywhere from 1,000 to 
l00 t OGO bases in length, though smaller and larger genes do exist. Of that amount 
of DNA, only a fraction of the base pairs actually encode the protein. These 
discontinuous protein coding regions are called exons and the remainder of the 
gene is referred to as introns. Of these two types of regions, exons often contain 
the most important sequences to be screened. Several complex procedures have 
been developed for scanning genes in order to detect mutations, which are 
applicable to both exons and intxons. 

fi^lectnyphpyesis: Several of the procedures described below use some form of 
gel electrophoresis. Therefore it is worthwhile to briefly consider this separation 
technology before proceeding to the specific methods. In terms of current use, 
most of the methods to scan or s creen genes employ slab or capillary gel 
electrophoresis for the separation and detection step in the assays. Gel 
electrophoresis of nucleic acids primarily provides relative size information based 
on mobility through the gel matrix. If calibration standards are employed, gel 
electrophoresis can be used to measure absolute and relative molecular weights of 
huge biomolecules with some moderate degree of accuracy; even then typically the 
accuracy is only 5% to 10%. Also the molecular weight resolution is limited. In 
cases where two DNA fragments with identical number of base pairs can be 
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separated, using high concentration polyacrylamide gels, it is still not possible to 
identify which band on a gel corresponds to which DNA fragment without 
performing secondary labeling experiments. Gel electrophoresis techniques can 
only determine size and cannot provide any information about changes in base 
composition or sequence without performing more complex sequencing reactions. 
Gel-based techniques, for the most part, are dependent on labeling methods to 
visualize and discriminate between different nucleic acid fragments, 

' BEtft gequeircmg: The principal approach currently used to screen for genetic 
mutations is DNA sequencing. Sequencing reactions can be performed to screen 
the full genetic target base by base. This process, which can pinpoint the exact 
location and nature of muLation, requires labeling DNA, use of polyacrylamide 
gels, and a multiplicity of reactions 10 assess all bases over the length of a gene, 
all of which are slow and labor intensive procedures, [J. Bergh et al. "Complete 
Sequencing of the p53 Gene Provides Prognostic Information in Breast Cancer 
Patients, Particularly tn Relation to Adjuvant Systemic Therapy and 
Radiotherapy/ Nature Medicine L 1029 (1995)] 

For DNA sequencing, nucleic acids comprising different exons or smalt clusters of 
exons are individually amplified, often using polymerase chain reaction <PCR)> 
The amplifications are normally performed separately although some multiplexing 
of reactions is possible. Tie amplified nucleic acids typically range from one 
hundred to several thousand bases in length. Following amplification, die PCR 
products can serve as templates for standard dideoxy-based Sanger sequencing 
reactions. The four different sequencing reactions arc run (or for fluorescence 
detection, one reaction with four different dye terminators) and then analyzed by 
poly^iylamidc gel electrophoresis. Each sequencing run yields about 300 to 600 
bases of sequence which typically must be read with at least a two to three- Fold 
redundancy in order to assure accuracy. Using slab gel. the analysis process 
typical ty takes several hours. 
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SSCP : The single strand conformational polymorphism assay takes advantage of 
structural variation within DNA that results from mutation. The method involves 
folding the single-stninded fonn of a given nucleic acid sequence into a 
thermodynamically directed secondary and tertiary structure. In most cases, 
5 mutated sequences form different structures than the wild type sequence, thus 

permitting separation of mutated and wild type sequences by gel electrophoresis, 
like sequencing, thus a N s$ay i$ complicated by the need to label molecules and run 
polyacrylamidc gels. In a typical case, mutations can be located within a general 
"range of 50 to 200 base pairs, but the exact nature of Ac mutation cannot be 
10 identified. [M. Orita et al. f "Detection of Polymorphisms of Human DNA by Gel 

Electrophoresis as Single-Stranded Conformation Polymorphisms/ Proc, Natl. 
Acad. Sti. USA 86, 2766 (1989)1 

DGGE : Like SSCP, denaturing gradient gel electrophoresis assays also 
15 differentiate based on structural variation, but require the use of gradient gels, 

which are difficult to prepare. The different thermodynamic stability of structures 
formed by the mutant sequence, as opposed to wild type, lead to differences in the 
temperature and/or pH at which the molecule will denature. DGGE mutation 
identification and localization properties arc simitar to those for SSCP though 
20 sensitivity is higher for DGGE because not all mutations cause the structural 

changes that die SSCP method depends upon for detection. [E,S. Abrams, S.R 
Murdangh & L.S. Lerman, "Comprehensive Detection of Single Base Changes in 
Human Genomic DNA Using Denaturing Gradient Gd Electrophoresis and a GC 
Clamp/ Genomics L 463 (1990)] 

25 

EMC : Enzyme mismatch cleavage utilizes one or more enzymes that are capable 
of recognizing interruptions in base pairing within a double-stranded nucleic acid 
molecule, e.g. base-base mismatches, bulges, or internal loops. A given length of 
DNA or RNA is prepared in heterozygous form, with one strand composed of 
30 wild type nucleic acid and the other strand containing a potential mutation. At the 

specific site where the mutation forms a mismatch with the wild type sequence, a 
structural perturbation occurs. An enzyme such as T4 endonuclease VII, RuvC, 
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RNase A, or MutY. can recognize such a structural perturbation and can silo 
specifically cut the dnuhlostranded nucleic acid, creating smaller molecules whose 
sizes indicate the presence and location of the mutation. As with the previously 
discussed methods, this approach as currently used, also requires labeling and gel 
electrophoresis. With thus method, the site of mutation can be localized to within 
a few base pairs but the exact nature of the mutation cannot he deteimincd, [R. 
Youil, B.W. Kemper & R.G.H. Cotton, "Screening for Mutations by Enzyme 
Mismatch Cleavage withT4 Endonuclease VII/ Proc. NatL Acad. Scl USA 22, 
- 87 (1995)} 

CCM: A variation of EMC is to replace the enzymatic cleavage step with chemical 
cleavage. Chemical cleavage mismatch analysis involves the use of reagents such 
as osmium tetroxide to react with mismatched thymine residues or hydroxy iamine 
to react with mismatched cytosine residues. Cleavage of the modified mismatched 
residues occurs when the modified bases are subsequently treated with piperidine 
or another oxidizing agent. The effectiveness of the method is similar to EMC. 
[I .A. SaJecba & R.G.H. Cotton, "Chemical Cleavage of Mismatch to Detect 
Mutations, " Methods in Jfnzymoiogy 217, 286 (J 993)] 

Hybridizati on Arrays : Several approaches to screening for mutations involve the 
probing of a target nucleic acid by an array of oligonucleotides thai can 
differentiate between normal wild type nucleic acids and mutant nucleic acids. 
These arrays involve the performance of hundreds or thousands of hybridization 
reactions in parallel with different site-directed oligonucleotides and requires 
sophisticated and costly probe arrays. Hybridization arrays can identify the 
location and type of mutation in maty, but not all cases. For example, 
semihomologous sequential insertions or targets with repeating sequences and/or 
repeating sequential motifs cannot be analyzed by hybridization. [A.C. Pease ct 
al., "Light-Generated Oligonucleotide Arrays for Rapid DNA Sequence Analysis, 0 
Pros. Nail. Acad. Sci. USA £L, 5022 (1994)] 
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Simnle screens : For mutations localized within a given gene, such as the cystic 
fibrosis AF508 deletion, H is also possible to perform a single PCR or ligasc chain 
reaction (LCR) assay or simple hybridization assays tailored to these specific sites. 
PCR and LCR results are presently determined by the use of labeled molecules, 
where radioactive emissions, fluorescence, chemiluminescence or color changes 
are detected directly. These simple screen* amount to a yes/no answer and do noi 
directly identify the nature of the mutation, only whether or not a reaction took 
place. [P. Fang et aL, Simultaneous Analysis of Mutant and Normal Alleles tor 
Multiple Cystic Fibrosis Mutations by the ligasc Chain Reaction/ Human 
Mutation 6, 144 (1995)] 

AJI of the methods in use today capable of screening broadly for genetic 
mutations suffer from technical complication and are labor and time intensive. 
There is a need for new methods that can provide cost effective and expeditious 
means for screening genetic material in an effort to reduce medical expenses. The 
inventions described here address these issues by developing novel, tailor-made 
processes that focus on the use of mass spectrometry as a genetic analysis tool. 
Mass spectrometry requires minute samples, provides extremely detailed 
information about the molecules being analyzed including high mass accuracy, and 
is easily automated. 

The laie 1980's saw the rise of two new mass spectromctric techniques fur 
successfully measuring the masses of intact very large biomolcculcs, namely, 
matrix-assisted laser desotption'ionrzation (MALDI) time-of-flight mass 
spectrometry (TOF MS) [K. Tanaka et al., 'Protein and Polymer Analyses up to 
m/z 100,000 by Laser Ionization Time-of-flight Mass Spectrometry/ Rapid 
Commun. Mass Spectrom. 2. 151-153 (1988); B. Spengler et al., "Laser Mass 
Analysis hi Biology/ Bcr, Bunsenges. Phys. Chem. SI. 396-402 (1989)] and 
electrospray ionization (ESI) combined with a variety of mass analyzers [J. B. 
Perm et aL, Science 246, 64-71 (1989)1- Both of these two methods are suitable 
for genetic screening tests. The MALDI mass spectrometry technique can also be 
used with methods other than time-of-flight, for example, magnetic sector, 
Fourier-Transform, ion cyclotron resonance, quadropolc. and quadropole trap. 
One of the advances in MAI JOT analysis of polynucleotides was the discovery of 
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3-hydroxypicolinic acid as an ideal matrix for mixed-base oligonucleotides. Wu, 
et al., Rapid Commons in Mass Spectrometry, 7:142-146 (1993). 

MALDI-TOF MS involves laser pulses focused on a small sample plate 
comprising analyte molecules (nucleic acids) embedded in either a solid or liquid 
5 matrix comprising a small, highly absorbing compound. The laser pulses transfer 

energy to the matrix causing a microscopic ablation and concomitant ionization of 
the analytc molecules, producing a gaseous plume of intaci, charged nucleic acids 
in single-stranded form. If double-stranded nucleic acids are analyzed, the 
MALDI-TOF MS typically results in mostly denatured single-strand detection. 

10 The ions generated by the laser pulses are accelerated to a fixed kinetic energy by 

a strong electric field and then pass through an electric field-free region in vacuum 
in which the ions travel with a velocity cunesponding to their respective mass-to- 
charge ratios (m/2). The smaller m/z ions will travel through the vacuum region 
faster than the larger m/z ions thereby causing a separation. At the cod of the 

15 electric field-free region, the ions collide with a detector that generates a signal as 

each set of ions of a particular mass-to-charge ratio strikes the detector. Usually 
for a given assay, 10 to 100 mass spectra resulting from individual laser pulses are 
summed together to make a single composite mass spectrum with an improved 
signaNo-noise ratio. 

20 The mass of an ion (such as a charged nucleic acid) is measured by using 

its velocity to determine the mass-to-charge ratio by rime-of-flight analysis. In 
other words, the mass of the molecule directly correlates with the time it takes to 
travel from flic sample plate to the detector. The entire process takes only 
microseconds. In an automated apparatus, tens to hundred* of samples can be 

25 analyzed per minute. In addition to speed, MALDI-TOF MS has one of the 

largest mass ranges for mass spectrometry devices. The current mass range for 
MALJ1-TOF MS is from 1 to 1,000,000 Daltotis (Da) (measured recently for a 
protein). [FL W, Nelson et al., "Detection of Human IgM at m/z -1 MDa/ 
Rapid Common. Mass Spectrum. 2, 625 (1995)] 

30 The performance of a mass spectrometer is measured by its sensitivity, 

mass resolution and mass accuracy. Sensitivity iw measured by the amount of 
material needed; it is generally desirable and possible with mas* spectrometry to 
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work with sample amounts in the fetntomole and low picomole range. Mass 
resolution, m/Am, is the measure of an instrument's ability to produce separate 
signals from ions of similar mass. Mass resolution is defined as the mass, ra> of a 
ion signal divided by the full width of the signal. Am* usually measured between 
5 points of half-maximum intensity. Mass accuracy is the measure of error in 

designating a mass to an ion signal. The mass accuracy is defined as the ratio of 
the mass assignment error divided by the mass of the ion and can be represented 
as a percentage. 

To be able to detect any point mutation directly by M ALDI-TOF mass 

10 spectrometry, one would need to resolve and accurately measure the masses of 

nucleic acids in which a single base change has occurred (in comparison to the 
wild type nucleic acid). A single base change can be a mass difference of as iittlc 
as 9 Da. This value represents the difference between the two bases with the 
closest mass values, A and T (A - 2'-deoxy adenosine -5 '-phosphate = 313.19 Da; 

15 T = I'-deoxythymidine-S'-phosphate = 304,20 Da; G = 2 4 -deoxyguanosiiie-5 t - 

phosphate « 329.21 Da; and C = 2 , -deoxycytidine-5 > -phosphate = 2S9.19 Da). 
If during the mutation process, a single A changes to T or a single T to A, the 
mutant nucleic acid containing the base trausversion will either decrease or 
increase by 9 in total mass as compared to the wild type nucleic acid. For mass 

20 spectrometry lo directly detect these transversions, it must therefore be able to 

detect a minimum mass change. Am, of approximately 9 Da. 

For example, in order to fully resolve (which may not he necessary) a 
point-mutated (A to T or T to A) heteroiygote 50-base single-stranded DNA 
fragment having a mass, m, of - 15,000 Da from its corresponding wild type 

25 nucleic acid, the required mass resolution Is tn/Am = 15,000/9 1 ,700, 

However* the mass accuracy needs to be significantly better than 9 Da to increase 
quality assurance and to prevent ambiguities where the measured mass value is 
near the half-way point between the two theoretical masses. For an analyte of 
15,000 Da, in practice the mass accuracy needs to be Am - ±3 Da = 6 Da, In 

30 this case, the absolute mass accuracy required is (6/15,000)* 100 « 0.04%. Often 

a distinguishing level of mass accuracy relative to another known peak in Ate 
spectrum Ls sufficient to resolve ambiguities. For example, if there is a known 



WO £7/33000 PCT/US97/M439 

9. 

mass peak IO0O Da from die mass peak in question, the relative position of the 
unknown to the known peak may be known with greater accuracy than that 
provided by an absolute, previous calibration of the mass spectrometer 

In order for mass spectrometry to he a useful tool for screening for 
mutations in nucleic acids, several basic requirements need to be met. First, any 
nucleic acids to be analyzed must be purified to the extent that minimize.? salt ions 
and other molecular contaminants that reduce the intensity and quality of the mass 
spectrometric signal to a point where either the signal is undetectable or 
unreliable, or the mass accuracy and/or resolution is below the value necessary to 
detect single base change mutations. Second, the size of the nucleic acids to be 
analyzed must be within the range of the mass spectrometry- where there is the 
necessary mass resolution and accuracy. Mass accuracy and resolution do 
significantly degrade as the mass of the analyte increases; currently this is 
especially significant above approximately 30,000 Da for oligonucleotides ( - 100 
bases) Third, because all molecules within a sample are visualized during mass 
spectrometric analysis (i.e. it is not possible to selectively label and visualize 
certain molecules and not others as one can with gel electrophoresis methods) it is 
necessary to partition nucleic acid samples prior to analysis in order to remove 
unwanted nucleic acid products from the spectrum. Fourth, the mass 
spectrometric methods for generalized nucleic acid screening must be efficient and 
cost effective in order to screen a large number of nucleic acid bases in as few 
steps as possible. 

The methods for detecting nucleic acid mutations known in the art do not 
satisfy these four requirements. For example, prior art methods for mass 
spectrometric analysis of DNA fragments have focusscd on double-stranded DNA 
fragments which result in complicated mass spectra, making it difficult to resolve 
mass differences between two complementary strands. See, e.g., Tang ct al. , 
Rapid Comm a in Mass Spectrometry, 8:183 186 (1994). 

Urns, there is a need for cost and time effective methods of delecting 
genetic mutations using mass spectrometry, preferably MALDi or ES, without 
having to sequence the genetic material and with rmus accuracy of a few parts in 
10,000 or better. 
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SUMMARY OF T1IE INVENTION 

The present invention provides methods of and kits for detecting mutations 
in a laQset nucleic acid comprising nonrandomly fragmenting said target nudeic 
acid to form a set of nonrandom length fragments (NLFs), determining masses of 
members of said set of NLFs using mass spectrometry* wherein said determining 
does not involve sequencing of said target nucleic add. 

In a preferred embodiment, the method of detecting mutations comprises 
obtaining a set of nonrandom length fragments in single-stranded form. The 
masses of the members of the set of NLFs can be compared with the known or 
predicted masses of a set of NIJs derived from a wild type target nucleic acid that 
is the wild type version of the target nucleic acid that is being screened for 
mutations. The members of the set of single-stranded NLFs can optionally have 
one or more nucleotides replaced with mass-modified nucleotides, including mass- 
modified nucleotide analogs. Another optional aspect of the invention is the 
inclusion of internal calibrate or internal sdf-calibrants in the set of nonrandom 
length fragments to he analyzed by mass spectrometry to provide improved mass 
accuracy. 

The present invention includes a number of nonrandom fragmentation 
techniques for noorandomly fragmenting a target nucleic acid. 

In one embodiment, the nonrandom fragmentation technique comprises 
hybridizing a single-stranded target nucleic acid to one or more sets of 
fragmenting probes to form hybrid target nucleic acid /fragmenting probe 
complexes comprising at least one double-Rtnmded region and at least one single- 
stranded region, nonrandomly fragmenting raid target nucleic acid by cleaving said 
hybrid target nucleic acid/fragmenting probe complexes at every single-stranded 
region with at least one single* strand-specific cleaving reagent to form a set of 
NLFs, The set of fragmenting probes can leave single-stranded regions between 
double -stranded regions formed by hybridization of said set of fragmenting probes 
to said target nucleic acid. A single-stranded region comprises a portion of a 
polynucleotide sequence as small as a single phosphodicstcr bridge, i.e. the 
phosphodlester bond across from a nick, to 450 nucleotides in length. 
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The fragmenting probes are oligonucleotides that are complementary to a 
nucleotide sequence of the target nucleic acid. A set of fragmenting probes can be 
created such that the nucleotide sequences of the members of the set of 
fragmenting probes represents the entire complement to the nucleotide sequence of 
the target nucleic acid. For example, a set of fragmenting probes can provide 
complete complementary sequence to the target nucleic acid. Alternatively, a set 
of fragmenting probes, when hybridized to the target nucleic acid, can leave 
single -stranded regions. Also, one or more sets of fragmenting probes can be 
wed such that the members of one set of fragmenting probes contain nucleotide 
sequences that overlap with nucleotide sequences of members of a second set of 
fragmenting probes. In yet another aspect, there are provided two sets of 
fragmenting probes, where members of the second set of fragmenting probes 
comprise at least one single-stranded nucleotide sequence complementary to 
regions of said target nucleic acid that are not complementary to any nucleotide 
sequences in any members of said first set of fragmenting probes. 

Once the set(s) of fragmenting probes are hybridized to the target nucleic 
acid, the singJe-stranded regions are cleaved using single-strand -specific cleaving 
reagents, including enzymatic reagents as well as chemical reagents. Single-strand 
specific chemical cleaving reagents include hydroxy lamine. hydrogen peroxide, 
osmium tetroxide, and potassium permanganate. 

Yet another nomandom fragmentation technique comprises providing a 
single-stranded target nucleic acid, hybridizing the single-stranded target nucleic 
acid to one or more restriction sice probes to form hybridized target nucleic acids 
comprising double-stranded regions where said restriction site probes have 
hybridized to said single-stranded target nucleic acid and at least one single- 
stranded region* nonrandomly fragmenting the hybridized target nucleic acids 
using one or more restriction endonucleases that cleave at restriction sites within 
the double-stranded regions. Another variation on this technique involves use of 
universal restriction probes comprising two regions, the First region being single- 
stranded and complementary to a specific site within the target nucleic acid, and 
the second region being double-stranded and c ntainirtg the restriction recognition 
site for a particular class 115 restriction endonuckase. Class IIS restriction 
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endonucleases cleave double-stranded DNA at a specific distance from their 
recognition site sequence. 

Another technique fuT nonrandom fragmentation comprises fragmenting the 
target nucleic acid with one or more restriction endonucleases to form a set of 
NLFs. This and the other forms of nonrandom fragmentation can be combined 
with direct and indirect capture to a solid support to isolate single-stranded NLFs 
for mass spectrometric analysis. 

Another nonrandom fragmentation technique comprises providing 
conditions permitting folding of said single-stranded target nucleic acid to form a 
three-dimensional structure having intramolecular secondary and tertiary 
interactions, and nonrandomly fragmenting said folded target nucleic acid with at 
least one structure-specific endonuclease to form a set of single-stranded NLFs. A 
set of nonrandom length fragments can comprise a nested set of NLFs, wherein 
each member of the set has a 5' end of the target nucleic acid. The structure- 
specific endonucleases useful for nonrandom fragmentation comprise any nucleases 
that cleave at structural transitions within nucleic acids, including; Holliday 
junctions, single-strand to double-strand transitions, or at die ends of hairpin 
structures. 

Another nonrandom fragmentation method comprises mutation-specific 
cleavage by hybridizing a target nucleic acid to a set of one or more wfld type 
probes and specifically cleaving at any regions of nucleotide mismatch or base 
mismatch that form between the target nucleic acid and a wild type probe. The 
mutation-specific cleavage can be accomplished using a mutation-specific cleaving 
reagent comprising structure-specific endonuclease or chemical reagents. 

The nonrandom fragmentation methods described herein can be combined 
to form different sets or subsets of nonrandom length fragments. For example, 
the base mismatch nonrandom fragmentation method using wild type probes can be 
used in concert with a set of nonrandom length fragments that have already been 
creating using any one of the other nonrandom fragmentation methods. These 
nonrandom fragmentation methods can also be combined with isolation methods 
designed to isolate specific sets of single-stranded nonrandom length fragments, 
for example, only those NLFs derived from the + strand of the target nucleic 
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acid. The isolation methods include direel capture of the set of NLFs to a solid 
support or indirect capture of a $cr of NLFs to a solid support via a capture probe 
capable of binding to a solid support via covalent or noncovalent binding. The 
fragmenting, wild type, restriction site, and universal restriction probes described 
herein can be also be used as capture probes for isolating a particular sec of NLFs. 

The isolation methods also comprise the use of a solution of volatile sails lu 
wash away undesired contaminants from the set of NLFs intended for mass 
determination in the mass spectrometer. The volatile salts are useful for removing 
* background noise and can be easily removed by evaporation of the volatile salts 
prior to mass spactromecric analysis. Volatile salt solutions can be used in a 
variety of different methods to prepare organic molecules such as nucleic acids and 
polypeptides for mass spectrometry analysis. Thus, a method is described herein 
of decreasing background noise, wherein the method comprises obtaining a sampie 
co be analyzed by a mass spectrometer, washing the sample with a solution of 
volatile salts, and evaporating the solution of volatile salts from the sample. 

The fragmentation and isolation methods separately or together can also be 
combined with the use of internal self-caiibrants to improve the mass accuracy of 
the mass spectrometry analysis. 

The above methods, separately or in combination, can also be combined 
with the use of mass-modified nucleotides and mass-modified nucleotide analogs 
incorporated in the target nucleic acid or a set ui NLFs to improve mass resolution 
between mass peaks. 

Kits for detecting mutations in one or more target nucleic acids in a sample 
are also provided. In preferred embodiments, such kits comprise one or more 
single-stranded target nucleic acids, one or more sets of oligonucleotide probes, 
wherein each of said probes is complementary to a portion of said single-stranded 
target nucleic acids, and various cleaving reagents, including single-strand specific 
cleaving reagents, restriction cndonucleascs (both Class II and Class IIS), and 
mutation-specific cleaving reagents. The oligonucleotide probes include 
fragmenting probes, restriction site probes, and wild type probes. Such kits can 
also contain a matri*, preferably 3-hydroxypicoUnic acid. The kits may also 
contain volatile salt buffers, and buffers providing conditions suitable for the 
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enzymatic or chemical reactions described above for nonrandomly fragmenting 
target nucleic acid* and isolating nonrandom length fragments in preparation for 
maw spectrometry analysis. Additionally, the kits may contain solid supports for 
purposes of isolating nonrandom length fragments. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 A and IB display examples of resolved nucleic acid fragments 
(DNA) in the 20,000 lo 30,000 Da range using MALDI-TOF mass spectrometiy. 
Both FIG. 1 A and IB arc positive ion mass spectra obtained from 200 f moles of 
10 DNA in 3-IIPA (3-hydroxypicolinic acid). Each spectrum is a sum of 100 laser 

pulses at 266 nm. FIG. IA: a single-stranded 72-mer which also shows a 71 - 
mer. The FWIIM resolution is 240, clearly resolving matrix adducts (labelled M). 
FIG. IB: 88-mer parent peak has a resolution of 330. 

FIG, 2 is a diagram illustrating the basic steps for mass spectrometry 
15 analysis of a nonrandomly-fragmented, double-stranded target nucleic acid, 

FIG. 3 is a diagram illustrating the expected mass spectrum for a 
nonrandomly -fragmented double-stranded target nucleic acid that is a heterozygous 
mix of wild type and mutant nucleic acid where the mutation is an A to T 
transversion. 

20 FIG. 4A and 4B illustrate the effect on mass resolution of a mass- 

substituted base where a T has been replaced by heptynyldeoxyuridine during 
amplification of the mutant region. FIG. 4A depicts a mass spectra of a 
heterozygous mix of wild type and mutant where A has mutated to T. Spectral 
peaks axe separated by 9 mass units. FIG. 4B depicts a mass spectra of a 

25 heterozygous mix of wild type and mutant where A has mutated to T. T has been 

replaced by heptyny ldeoxyurid ine during amplification of the rautani region. 
Spectral peaks are now separated by 65 mass units. 

FIG. S is a diagram illustrating the affect of analyzing only positive strand 
fragments from a heterozygous sample In reducing the number of total fragments 

30 aud simplifying the mass spectrum, 

FIG. 6 is a diagram illustrating the use of restriction site probes to produce 
nonrandom fragments from single-stranded target nucleic acid. Note that in the 
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step of purifying nonrandoni length fragments, the small cleaved probes will likely 
be removed during purification 

FIG. 7A and R illustrate the use of fragmenting probes in conjunction with 
singk-strand-speciiic endonucJease to produce nonrandom fragments from single- 
stranded target nucleic acid. 

FIG, 8 Ls a diagram illustrating the use of fragmenting probes in 
conjunction with single-strand-spcsific, base-specific chemical cleavage to produce 
nonrandom fragments from single-stranded target nucleic acid, 

FIG. 9 A and B illustrate the use of fragmenting probes to produce 
nonrandom fragments from heterozygous, single-stranded target nucleic add in 
combination with the use a mismatch-specific cleaving reagent to further fragment 
the target nucleic acid at ihe site of a mutation. 

FIG, 10 is a diagram illustrating a method of detecting a mutation using 
mass Rpecirometric analysis of nonrandomly fragmented mutant and wild-type 
double-stranded nucleic acids that have been denatured and reannealed and then 
cleaved at any mismatch regions. 

FIG. II is a diagram illustrating die effect of analyzing only positive 
strand fragments from a heterozygous sample in reducing the number of cowl 
fragments and simplifying the mass spectrum. In this case the positive strand has 
been nonrandomly fragmented using both restriction endonuclease treatment and 
mismatch-specific cleavage. 

FIG, 12 is a diagram illustrating the use of structures-specific 
endoimcleases to nonrandomly fragment a folded, single-stranded target nucleic 
acid. 

FIC 13A and B illustrate (be use of a full length capture probe to isolate 
and purify a set of single-stranded nonrandom length fragments. Shown in FIG. 
13B as an option is a second step involving cleavage at mutation-specific 
mismatch, litis mismatch cleavage is particularly useful for cases where mutant 
DNA is hybridized to wild type. 

FIG. 14 is a mass spectrum of a set of nonrandom length fragments from a 
target nucleic acid containing a mutation, wherein the target nucleic acid is 
nonrandomly fragmented with bydroxylaxnine followed by piperidioc, resulting in 
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mutation-specific cleavage at a mismatch. This mass spectrum illustrates the 
presence of a nonrandom length fragment of 75 bases in size, lhai results from 
mutation-specific cleavage. 

FIG. 15 Ls a mass spectrum illustrating hydroxylamine fragmentation of a 
wild type control of the mutation-containing target nucleic acid of Fig. 14. This 
mass spectrum lacks a fragment of 75 bases in size due to the lack of a mutation 
in the wild type target nucleic acid. 

FIG. 16 b a mass spectrum of a mutation-containing target nucleic acid 
dial is specifically cleaved with potassium permanganate at the site of a base 
mismatch, 

FIG. 17 is a mast; spectrum of a set of 5 single-stranded nonrandom length 
fragments from an MNL 1 digest of a wild type target nucleic acid of 184 

nucleotides in length. 

FIG, 18 is a magnified mass spectrum of two fragments, both 26 bases in 
length, identical in nucleotide sequence except for a single G to A point mutation, 
Illustrating clear resolution of the two mass peaks. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

The present invention, directed to methods of screening target nucleic acids 
to detect mutations using mass spectrometry techniques to analyze post- 
amplification nucleic acids, provides the advantages of technical ease, speed, and 
high sensitivity (minute samples are required). The methods described herein 
yield a minimal set of products widi improved mass resolution and accuracy and 
detailed information about the nature and location of the 
mutation in the target nucleic acid. 

The present invention involves obtaining from a target nucleic acid, using a 
variety of nonrandom fragmentation techniques, a set of nonrandom length 
fragments (NLFs) and determining the mass of the members of the set of NLFs. 

The target nucleic acid can he single-stranded or double-stranded DNA, 
RNA or hybrids thereof, from any source, preferably from a human source, 
although any source which one is interested in screening for mutations can be used 
in the methods described herein. When the target nucleic acid is RNA, the RNA 
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strand is the + strand, If desired, the target nucleic acid can be "an RNA/DNA 
hybrid, wherein cither strand can be designated the + strand and the other, the - 
strand. The target nucleic acid is generally a nucleic acid which must be screened 
to determine whether it contains a mutation. The corresponding target nucleic acid 
derived from a wild type source is referred to as a wild type target nucleic acid. 
The target nucleic acids can be obtained from a source sampk containing nucleic 
acids and can be produced from the nucleic acid by PCR amplification or other 
amplification technique. The target nucleic acids are typicafly too large to analyze 
* directly because current mass spectrometry methods do not have the mass 
accuracy and resolution necessary to identify a single base change in molecules 
larger than 100 base pairs. Accordingly, the target nucleic acids mast be 
fragmented, 

Nonrandom length fragments are nucleic acids derived by nonrandom 
fragmentation of a target nucleic acid, and can comprise regions or nucleotide 
sequences that are single-stranded or double-stranded. Due to the simpler mass 
spectrum thai results from mass analysis of single-stranded nonrandom length 
fragments, it is preferred to determine the masses of sets of single-stranded 
nonrandom length fragments. The nuniandom length fragments can also contain 
mass-modified nucleotides, which can enhance ease of analysis, especially when a 
point mutation has resulted in a very small mass change (on the order of 9 Da) in 
a nonrandom length fragment as compared to the corresponding wild type 
nonrandom length fragment. The methods described herein use mass spectrometry 
to determine the masses of the set or sets of nonrandom length fragments to detect 
mutations in a target nucleic acid. 

The nonrandom fragmentation techniques of the invention are any methods 
of fragmenting nucleic adds dial provide a defined set of nonrandom length 
fragments, where that set of nonrandom length fragments may be reproducfrly 
obtained by using the same nonrandom fragmentation method on the same target 
nucleic acid or its wild type version. The methods used for nonrandom 
fragmentation arc designed to optimize the ease of analyzing the resulting mass 
spectral data by obtaining a range of fragment sizes that avoids significant overlap 
of mass peaks. The nonrandom fragmentation techniques of the invention include 
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digestion with restriction endonuc leases, structure-specific cndonucleases, and 
specific chemical cleavage. The enzymatic nonrandom fragmentation techniques 
include partial digestion with restriction cndonucleases or structure-specific 
endonucJeases. Partial cleavage occurs when not every possible cleavage site is 
cleaved by the cleaving reagents used, whether enzymatic or chemical. 

Fragmenting probes used in the invention are nucleic acids comprising a 
single-stranded nucleotide sequence or regiuo (hat is complementary to a 
nucleotide sequence of a target nucleic acid. When fragmenting probes arc also 
used as capture probes (i.e. to bind the fragmenting probe and any complementary 
nucleic acids hybridized thereto to a solid support), the fragmenting probes 
comprise a first binding moiety that is capable of binding to a second binding 
moiety attached to a solid support. Upon hybridization of a set of fragmenting 
probes and a target nucleic acid, the hybrid can be nourandomly fragmented using 
one or more cleaving reagents that specifically cleave single-stranded regions. 

Restriction site probes are oligonucleotides that when hybridized to single- 
stranded target nucleic acid at specific complementary sequences form complete 
double-stranded restriction endonuclease recognition sites cleavable using the 
restriction endonuclease capable of cleaving at or near the recognition sites 
formed. 

Universal restriction probes comprise two regions, the first region being 
single-stranded and complementary to a specific sequence within the target nucleic 
acid, and the second region being double-stranded and containing the restriction 
recognition site for a particular class IIS restriction endonuclease. 

Capture probes used in the methods described herein comprise fragmenting 
probes, restriction site probes, universal restriction probes, and aery nucleic acids 
that are bound to a solid support to isolate sets or subsets of nucleic acids or 
NLFs, Capture probes can comprise a cleavable linkage or cleavable moiety that 
can be selectively cleaved to release nucleic acids from a solid support prior to 
mass specaometric analysis. 

Wild typo probes are nucleic acids derived from a wild type nucleic acid 
sequence comprising at least one nucleotide sequence complementary to a 
nucleotide sequence of a target nucleic acid or a member of a set of NLFs. Wild 
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type prober can be restriction site probes, fragmenting probes, or capture probes 
comprising a wild type nucleotide sequence that when hybridized to a 
complementary mutation-containing region of a target nucleic acid results in a base 
mismatch bulge or loop structure. Wild type refers to a standard or reference 
nucleotide sequence to which variations are compared. As defined, any variation 
from wild type is considered a mutatiun, including naturally occurring sequence 
polymorphisms. 

The term complementary refers to the formation of sufficient hydrogen 
bonding between two nucleic acids to stabilize a double-stranded nucleotide 
sequence formed by hybridization of the two nucleic acids. 

A single-stranded region comprises a portion of a nucleotide sequence thai 
is capable of being selectively cleaved by single-strand -specific cleaving reagents 
or structure-specific endomicleases, wherein the portion of a nucleotide sequence 
can range in size from a single phosphodiester bridge, i.e. the phosphod tester bond 
across from a nick, to a nucleotide sequence ranging from one to 450 nucleotides 
in length which are not hybridized to a complementary nucleotide sequence or 
region. 

The types of mass spectrometry used in the invention include ESI or 
MALD1, wherein the MALDT method may optionally include time-of- flight. The 
significant multiple charging of molecules in ESI and the fact that complex mixture 
analysis is generally required mean that the ESI mas* spectra will consist of a 
great many spectral peaks, possibly overlapping and causing confusion. Because 
the MALDI MS approach produces mass spectra with many fewer major peaks r 
this method is preferred. 

The methods described herein do not require sequencing of die target 
nucleic acid (using the sequencing methods that require four different base-specific 
chain termination reactions to determine (be complete nucleotide sequence of a 
nucleic acid) In order to determine the nature and presence of a mutation within 
the target nucleic acid. 

For atl initial mutation screen, a useful range of fragment sizes that will 
allow detection of a point mutation is around 10 to 100 bases. This size range is 
where mass spectrometry presently has the necessary level of maw resolution and 
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accuracy. Thus, the fragmentation methods used in this invention are designed to 
produce from the target nucleic acid, a set of nonrandom length fragments ranging 
up to 100 bases in size. For purposes of this invention, fragmentation methods 
that produce a set of random length fragments arc not desirable due to the limited 
5 reproducibility of such fragments, the limited information available from mass 

spectrometry analysis of such fragments, and the likelihood of spectral overlap 
from randomly generated fragments. For example, nonrandom fragmentation 
permits determination of the mass, base composition, and location of the set of 
"NLFs relative to the target nucleic acid, whereas random fragrneniatiun methods 
10 do not 

Existing mass spectrometry instrumentation in the case of MALDI-TOF 
MS optimally has a mass accuracy of about 1 pan in 10,000 (0.01 %), four times 
what is necessary for detecting a single base change in a 5G-basc long single- 
stranded DN A fragment. Utilization of mass-modified nucleotides (to be described 

15 later) and nearby masses as internal cati brants, provides optimal resolution and 

mass accuracy of larger nucleic acids, and can extend the usable mutation 
detection range up to 100 bases, if nut higher. Continued advances in mass 
spectrometry instrumentation will also push this range higher. 

Examples of the resolving capabilities of MALDJ-TOF MS are displayed in 

20 FIG. 1A and IB. FIG. 1 shows the positive ion TOF mass bpectra obtained from 

200 fmoles of DNA in the matrix 3-HPA- FIG. 1A (top figure) shows two singles 
stranded PGR products of lengths 71 and 72 (mass difference = 305 Da = 
Adenosine) as well as the 72mer and 72mer + a single matrix adduct (M) (mass 
difference » 139 Da) to be well resolved (FWHM resolution = 240). FIG. IB 

25 (bottom figure) shows an 88 base length single stranded product having a 

resolution of 330. Both spectra display high enough accuracy and resolution to 
detect a point mutation if one were present. 

These unique properties of mass spectrometry, MALDI-TOF MS in 
particular, to separate nucleic acid fragments and identify dieir mass exactly and 

30 the methods taught herein provide novel methods for the screening of target 

nucleic acids and identification of changes in base composition that might result 
from genetic mutation. 
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Improving Mass accuracy By Internal Calibration and Internal 
Self-Calibration 

Mass spectrometers arc typically calibrated using analytes of known mass. 
5 A mass spectrometer can then analyze an analyte of unknown mass with an 

associated mass accuracy and precision. However, the calibration, and associated 
mass accuracy and precision, for a given mass spectrometry system (including 
MALDI-TOF MS) can be significantly improved if analytes of known mass are 
contained within die sample containing the analyte(s) of unknown mass(es). The 

10 * inclusion of these known mass analytes within the sample is referred to as use of 
internal calibrams. External calibrates, Le. analytes of known mass that are not 
mixed in with the set of nonrandom length fragments of unknown mass and 
simultaneously analyzed in a mass spectrometer, arc analyzed separately. External 
calibrams can also be used to improve mass accuracy, but because they arc not 

IS analyzed simultaneously with the set of fragments of unknown mass, they will not 

increase mass accuracy as much as internal calibrams do. Another disadvantage of 
using external calibrams is (hat it requires an extra sample to be analyzed by the 
mass spectrometer. For MALDI-TOF MS, generally only two calibrant molecules 
are needed for complete calibration, although sometimes three or more calibrams 

20 are used. All of the embodiments or the invention described herein can be 

performed whh the use of internal calibrant* to provide improved mass accuracy. 

Using the methods described herein, one can obtain a mass spectrum with 
numerous mass peaks corresponding to the set of nonrandom length fragments of 
the gene or target nucleic acid under study. If no mutation is present in the target 

25 nucleic acid, all of the mass peaks corresponding to the nonrandom length 

fragments will be at mass-to-charge ratios associated with the set of NLFs from 
the wfld type target nucleic acid. However* if the target nucleic acid contains a 
mutation, usually no more than one or two of the mass peaks will be shifted in 
mass, leaving the majority of mass peaks at unaltered locations. In a preferred 

30 embodiment of the invention, a self-calibration algorithm uses these unmutated or 

nonpolymorphic NLFs for internal calibration to optimize the mass accuracy for 
analysis of the NLFs containing a mutation, thus requiting no added calibrates), 
simplifying the calibration, and avoiding potential spectral overlaps. In a given 
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sample, however, it will not be known a priori which mass peaks, if any. arc 
altered or shifted from their expected masses for the wild type NLFs. 

The self-calibration algorithm begins by dividing up the observed mass 
peaks into subsets, each subset consisting of all but one or two of the observed 
mass peaks. Each data subset has a different one or two mass peaks deleted from 
consideration. For each subset, the algorithm divides the subset further into a first 
group of two or three masses which are then used to generate a new set of 
calibration constants, and a second group which will serve as an internal 
consistency check on those new constants. The internal consistency check begins 
by calculating the mass difference between the m/z values calculated for the 
second group of mass peaks and the values corresponding to reasonable choices 
for the associated wild-type NLFs. The internal consistency check can thus take 
Ihe form of a chi-square minimization where the key parameter is this mass 
difference. The algorithm finds which data subset has the lowest sum of the 
squares of these mass differences resulting in a choice of optimized calibration 
constants associated with group one of this data subset. 

After new self-optimized calibration constants are obtained, the mass-to- 
charge ratios are determined for the mass peaks omitted from the data subset; 
these are the non random length fragments suspected to contain a mutation. The 
differences from the observed mass peaks for the wild type NLFs are then used to 
determine whether a mutation has occurred, and if so, what the nature of this 
mutation fas (e.g. the exact type of deletion, insertion, or point mutation). This 
self-calibration procedure should yield a mass accuracy of approximately 1 part In 
10,000, 

Fragmentation of Target Nucixtc acids 

Fragmentation of a target nucleic acid is important for several reasons. 
First, fragmentation allows direct analysis of large segments of a gene or other 
target nucleic acid using a single PCR amplification, eliminating the need to 
multiplex or run separately many smaller-segment PCR reactions. 

Second, sequencing of thousands of bases of a gene or other target nucleic 
acid, by mass spectrometry or otherwise, is a complex and expensive process. 
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With currcm capabilities in MALDI and ESI, it is impractical to sequence nucleic 
acids greater than 50-100 bases in length. Consequently, in order to rapidly 
screen large genetic regions or target nucleic acids using mass spectrometry 
nucleic acid sequencing, an impractical and cumbersome number of independent 
sequencing reactions arc necessary to cover the entire genetic region of interest. 
Therefore, for screening large genetic regions or targci nucleic acids for a wide 
range of potential mutations using mass spectrometry , fragmentation of amplified 
target nucleic acids ranging from 100 to 1000 base pairs (bp) facilitates faster 
* screening of larger target nucleic acids or genetic regions of interest. 

Sequencing can identify the exact location and nature of a genetic mutation 
in a target nucleic acid, hut requires the use of many primers in many separate 
reactions. Mutations, especially for heterozygous samples analyzed using 
fluorescence-based systems, arc often difficult to identify with confidence. Using 
the fragmentation methods described herein, a heterozygous sample would yield 
two distinct mass spectral peaks, correlating to the different masses of the mutant 
and wild type nucleic acids. Accordingly, the methods described herein can be 
used to detect a mutation in a target nucleic acid unambiguously. 

Third, mass spectrometry analysis of smaller nucleic acid fragments, 
ranging in size from 2 to 300 bases, more preferably from 10 to 100 bases in 
length, is desirable because the smaller nucleic acid fragments result in: 

(a) more specific localization of any mutations than for larger sized nucleic 
acid fragments, 

(b) superior mass accuracy and resolution of nucleic acid fragments in this 
mass range, and 

(c) a multiplicity of mass peaks that can be used as internal self-calibration 
standards, further improving the mass accuracy. 

For analysis with MALDKfQF MS, the goal of fragmentation is tu 
produce a sd of nonrandom length fragments ranging in length from 2-300 bases, 
preferably from 10-100 bases in length. The range of lengths serves to better 
separate and resolve the fragment peaks in the resulting mass spectrum. 

Fragmentation of target nucleic acids larger than 100 bases in length can be 
accomplished using a number of means, including cleavage with one or more 
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1>NA restriction eixionucleases targeting specific sequences within double-stranded 
DMA, chemical cleavage at stnKaure-specific and/or base-specific locations, 
polymerase incorporation of modified nucleotides that create cleavage sites when 
incorporated, arid targeted structure-specific and/or seque ace-specific nuclease 
treatment. 

An exemplary case is where a larger target nucleic acid* e.g. 500 bases in 
length, is nonrandom] y fragmented to produce 10 to 30 nonrandom length 
fragments dwt can all be individually resolved by MALDI-TOF mass 
spectrometry. Two different nonnuidom length fragments having the same number 
of bases can still be resolved from each other by mass spectrometry when they 
differ in base composition and consequently in mass. Gel electrophoresis methods 
typically cannot resolve equivalent length fragments. 

For example, for a 5 kilobase pair (kb) target nucleic acid to be fully 
analyzed , using nonrandom length fragments with an average size of 30 bases, 
approximately 170 nonrandom length fragments would need to be screened. 
Typically, the target nucleic add would be amplified by a number of DNA 
amplifications, ~ 10-20, in order to reduce the number of fragments to be 
analyzed in any given sample. Each amplified target nucleic acid product would 
be digested using restriction endonuc leases, often with four-base recognition sites 
to produce the optimal size fragments. It is preferable that the fragments vary in 
size to simplify the mass spectral data, e.g, 32 bp + 28hp+27bp-r37bp4 

although, as stated above, nonrandom length fragments of the same size could 
potentially be analyzed if their base compositions vary enough to minimize spectral 
overlap. 

A schematic of the process along with a hypothetical mass spectrum is 
shown in FIG. 2. FIG. 2 illustrates a 161 base target nucleic add that has been 
PCR amplified and fragmented using restriction eodonucleases. The resulting 6 
nonrandom length fragments arc produced. When the laser desorption process 
occurs, during MALD1-TOF mass spectrometry analysis, the 6 double-stranded 
fragments are mostly denatured and the resulting 12 single-stranded nonrandom 
length fragments are ionized and detected. Shown at the bottom of FIG. 2 is a 
simulated mass spectral data plot with all the mass peaks resolved. 
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As can be seen in FIG. 2 h is very common that restriction endonuclcase 
treatment will produce a number of complementary fragments with the fame 
number of bases, e.g. two at 19 and two at 32. The presence of these equaJ- 
length fragments places higher constraints on the required resolution for 
distinguishing all of the different peaks. It is a!so not uncommon for the two 
equal-length, complementary fragments to have identical or nearly identical mass 
values, leaving rhe possibility that two complementary fragments will not be 
resolvable. 

Often samples will be heterozygous, containing a 50% mixture of both the 
normal wild type nucleic acid and the mutated target nucleic acid. In the case 
where the target nucleic acid carries a mutation in a heterozygous mix, one would 
observe a splitting of peaks within the nonrandom length fragments containing the 
mutation. An example of this splitting is shown in FIG. 3 where an A-T to T-A 
transversion or base flip has occurred in one copy of the gene. The expected 
peaks would be half normal height since their concentrations are halved relative to 
homozygous concentrations. In this case, the difference between mutant and wild 
type peaks would be -9 Da which can be resolved in the 32 base long fragment. 
The presence of wild type peaks provides internal sdf-calinranti allowing highly 
accurate mass differences (as opposed to absolute mass) to be used to determine 
the base composition change. 

The methods described herein permit MALDI-TOF MS analysis of 
nonrandom length fragments which has a mass accuracy of approximately 1 pan in 
10,000. The use of internal sclf-calibrants makes it possible to extend this level of 
accuracy up to and potentially beyond 30,000 Da or 100 bases. This mass 
accuracy enables exact sizing of nucleic acid fragments and die determination of 
the presence and nature of any mutation, including point mutations, insertions and 
deletions, even in a heterozygous environment. Further described herein arc 
methods for improving the resolution of individual fragments by means including 
elimination of equal-length complementary pairs through the use single-straud- 
targeted fragmentation and/or isolation procedures, and the incorporation of mass- 
modified nucleotides to enhance the mass difference between similar sized 
fragments and/or mutant and wild type fragments. In addition, these methods 
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provide for the removal of sales and other deleterious materials als well as a means 
for the removal of unwanted nucleic acid fragments prior to mass spectroscopic 
analysis. 

Mass RraoumoN, Mass Accuracy, and tok lists ok Mass-Modified 
Nucleotides 

Any of the embodiments of the invention described herein optionally 
include noitrandom length fragments having one or more nucleotides replaced with 
mass-modified nucleotides, wherein said mass-modified nucleotides comprise 
nucleotides or nucleotide analogs having modifications that change their mass 
relative to die nucleotides that they replace. The mass-modified nucleotides 
incorporated into the nonrandom length fragments of the invention must be 
amenable to the enzymatic and nonenzymatic processes used for the production of 
nonrandom length fragments. For example* the mass-modified nucleotides must 
be able to he incorporated by DNA or RNA polymerase during amplification of 
the target nucleic acid. Moreover, the mass-modified nucleotides must not inhibit 
the processes used to produce nonrandom length fragments, including, inter alia % 
specific cleavage by restriction endonucleases or structure-specific endonucleases 
and digestion by single-strand specific endonucleases, whenever such steps are 
used. Mass*mod ifications can also be incorporated in the nonrandom length 
fragments of the invention after the enzymatic steps have been concluded. For 
example, a number of small chemicals can react to modify specific bases, such as 
kethoxal or formaldehyde. 

Any or ail of the nucleotides in the nonrandom length fragments can be 
mass-modified, if cecessary , to increase the spread between their masses. It has 
been shown that modifications at the C5 position m pyriroidines or the N7 position 
in purines do not prevent their incorporation into growing nucleic acid chains by 
DNA or RNA polymerase. [L. Lee et al. "DNA Sequencing with Dye-Labeled 
Terminators and T7 DNA Polymerase: Effect of Dyes and dNTPs on 
Incorporation of Dye-Terminaiors and Probability Analysis of Termination 
Fragments" Nuc. Acids. Res. 2Q. 2471 (1992)] For sample, an octynyl moiety 
can be used in place of methyl on thymidine to alter the mass by 94 Da. 
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Mass-modifying groups can be, for example, halogen, alkyL ester or 
polyester, ether or pulyether, or cf the general type XR, wherein X is a linking 
group and R is a mass-modifying group. The mass-modifying group can be used 
to introduce defined mass increments into the nonrandom length fragments. One 
of skill in the art will recognize that there are numerous possibilities for mass- 
modifications useful in modifying nucleic acid fragments or oligonucleotides, 
including those described in Oligonucleotides and Analogues; A Practical 
Approach, Eckstein ed. (Oxford 1991) and in PCT/US94/00I93, which are both 
* incorporated herein by reference. 

At larger mass ranges (30,000-90.000 Da), the mass resolution and mass 
accuracy of current MALDI-TOF mass spectrometers will not be sufficient to 
identify a single base change. For this reason, it may be preferable to increase the 
useful mass range artificially by substituting standard nucleotides within either a 
target nucleic acid or a nonrandom length fragment with mass-modified nucleotides 
having significantly larger mass differentials. Use of mass-modified nucleotides 
applies as well to the mass range below 30.000 Da. Mass modification can 
generally increase the quality of the mass spectra by enlarging the mass differences 
between NLFs of similar size and composition. For example, mass-modified 
nucleotides can increase the minimum mass difference between two nonrandom 
length fragments that are identical in base composition except for a single base 
which is an A in one NLF and is a T in the other. Normally, these two NLFs will 
differ in mass by only 9 Da. By incorporating a single mass-modified nucleotide 
into one of the bases, the mass difference can be >20 Da. The spectra in FIG. 4 
depict the influence mass-modified nucleotides can have on fragment resolution. 
One example of the many possible mass modifications useful in this invention is 
the use of 5-(2-hcptynylV-deoxyuridinc in place of thymidine. Tbe replacement cf 
a methyl group by heptynyt changes the mass of this particular nucleotide by 65 
Da. An A to T trans version in a nucleic acid fragment in which alt thymidine 
bases have been replaced with 5-(2-bcptynyl)-deoxyuridine would produce a peak 
shift of 56 T>a us opposed to 9 Da for the same nucleic ackl fragments without the 
mass-modified nucleotides. The use of mass-modified nucleotides is especially 
important in the analysis of NLFs derived from RNA. Normally, the masses of C 
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and U vary by only 1 Da, making it practically impossible to delect C 10 U or U 
to C point mutations within a given fragment. 

Benefits of Analyzing Singue>Stranded Nucleic acids 

5 The goal of this invention is the accurate determination of the masses of a 

set of resolved nonrandom length fragments and correlation of this data to the 
characterization of any mutation, if present. The embodiments of this invention 
include mass spectrometry determination of masses of the members of a set of 
Single-stranded nonrandom length fragments as well as mass determination of the 

10 members of a set of mass-modified, double-stranded nonrandom length fragments. 

The preferred embodiment is to detect mutations in a target nucleic acid 
comprising obtaining a set of nonrandom length fragments in single-stranded form, 
wherein the single-stranded nonrandom length fragments are derived from one nf 
either the positive or the negative strand of the target nucleic acid or where the set 

15 is a subset of fragments derived from both the positive and the negative strands of 

the target nucleic acid. The examples of single-stranded methods described herein 
focus on fragments derived ftum the positive strand. 

FIG, 2 and 3 illustrate that each double-stranded nonrandom length 
fragment, comprising two complementary strands, produces two peaks in the mass 

20 spectrum corresponding to die denatured single strands. The additional peaks 

from double-stranded nonrandom length fragment* as compared to single-stranded 
nonrandom length fragments add to congestion of mass peaks in Che mass spectra, 
as well as introducing the possibility chat it may be extremely difficult, if not 
impossible, to resolve the complementary fragments if they have nearly or exactly 

25 identical base compositions. Furthermore, some portion of die double-stranded 

nonrandom length fragments do not fully denature, and mass peaks corresponding 
to Che double-stranded products increase the spectral congestion. 

Because spectra using both strands contain a two-fold redundancy in data, 
since any mutation in one strand will be present within its complement, it is 

30 reasonable to remove one strand prior to mass spectrometry analysis and still 

produce all of the data necessary for complete mutation analysis. For these 
reasons, it is the preferred embodiment to analyse a set of single strands where 
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only one of tbe two complementary sets nucleic acid fragments representing the 
fill] target sequence is used. 

FIG, 5 shows the expected spectrum if only Ihe nonrandomly fragmented 
positive strand of a target nucleic acid from FIG. 3 is analyzed by mass 
spectrometry. Analysis of one of the two complementary strands of the double- 
stranded nonrandom length fragments halves the number of expected peaks within 
the mass spectra, allowing more total fragments to be resolved and the possibility 
that longer total sized target nucleic acids can be analyzed at one time. Removal 
* of one of the two strands from each nonrandom length fragment eliminates the 
greatest source of complication for each spectra. A number of methods for 
isolating and preparing both single-stranded and double-stranded nonraitdom length 
fragments for mass spectrometry are described herein. 

Methods of Nonrandom Fragmentation of T argot Nucleic Acids 

The methods of the invention all involve obtaining from a target nucleic 
acid a set of resolvable, nonrandom- length fragments and determining the mass of 
the members of thai set using mass spectrometry without sequencing the target 
nucleic acid. All of the methods described herein involving mass spectrometry 
include inter alia two types of mass spectrometry, electrospray ionization (ESI) 
and matrix-assisted laser desorptiony ionization time-of-fltght (MALDI-TOF). In 
addition to tbe restriction endonuclea&e approach to nonrandomly fragmenting a 
target nucleic acid, there are a number of other approaches which arc described 
below. 

Nonrandom Fragmentation using Restriction Site probes 

Target nucleic acid can be nonrandomly fragmented using hybridization to 
nucleic acid, restriction site probes followed by cleavage with one or more 
restriction endonudeascs the recognition sequences of which are contained in the 
restriction site probes used. "Restriction site probes* are oligonucleotides that 
when hybridized to single-stranded target nucleic acid at specific sequences form a 
complete double-stranded recognition site cleavablc using restriction 
endonucleases. Tbe use f restriction site probes is illustrated in FIG. 6* 
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Oligodcoxynuckotide and Enzyme Moieties, " Gene 40 f 169 (1985) (incorporated 
by reference herein}] These universal restriction pmtes comprise wo regions, tbe 
first region being single-stranded and complementary to a specific sequence within 
the target nucleic acid, and the second region being double-stranded and containing 
5 the restriction recognition site for a particular class IIS restriction endonuclease. 

Class US restriction endonucieases cleave double-stranded DNA at a specific 
distance from their recognition sequence. By using this property, and the 
universal restriction site probe design, it is possible to nonrandomly fragment a 
single-stranded DNA target at virtually any sequence, providing die means to 

10 better control the selection of fragment sizes. It is also possible to mix standard 

restriction site probes and universal restriction probes in a single reaction. 

In this approach, a positive single-stranded target nucleic acid is hybridized 
to one or more restriction site probes that are complementary to one or more 
restriction endonuclea.se recognition sequences within the target nucleic acid. Upon 

IS hybridization of die restriction site probes to the target nucleic acid, hybridized 

target nucleic acids are formed, comprising double-stranded regions where the 
restriction site probes have hybridized to the target nucleic acid and at least one 
single-stranded region where the target nucleic acid remains unhybridized to a 
restriction site probe. The double-stranded regions of the hybridized target nucleic 

20 acids are recognition sites for cleavage by one, two or more restriction 

endonucleases. After die formation of hybridized target nucleic acids, the 
hybridized target nucleic acids are digested with one, two or more restriction 
endonucleases, the recognition sequences of which are contained within the 
double-stranded regions. 

25 The resulting nonnmdotn length fragments have at least one cleaved 

restriction site oligonucleotide probe annealed. In some cases, these cleaved 
probes will be of a size too small to remain hybridized to the target fragments. 
These nonrandom length fragments can either be purified with the cleaved 
restriction site oligonucleotide probes attached, or the NLFs can be purified from 

30 the cleaved oligonucleotide restriction site probes. Roth types of purification can 

be accomplished using a variety f techniques known in the art, including 
filtration, precipitation, or dialysis. The preferred approach is to capture the 
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NLFs to a solid support. The set of nonrandom length fragments can be directly 
captured (o a solid support themselves using a number of means including a 
binding moiety such as biotin incorporated at numerous base positions throughotit 
the NLFs. Or the NLFs can be indirectly capmral to a solid support via 
5 hybridization to one or more capture probes that is itself bound to a solid support. 

The capture probe can comprise the full-length strand of the target nucleic acid 
that is complementary tn the strand from which the nonrandom length fragments 
were derived. Alternatively, the capture probes can be a set of capture probes 
each containing at least one sequence complementary to said nonrandom length 

10 fragments. 

By combining an asymmetric amplification method to produce single- 
stranded target nucleic acids with the use of restriction site probes, as described 
herein, one can produce predominantly ihe desired set of single-stranded NLFs. 
The restriction site probes used to produce the recognition sites may cupurify with 

15 the NLFs but can be designed so that they do not interfere with the majority of the 

mass spectra. For example, die restriction site probes can be designed so that 
after cleavage their final sizes are less than 20 bases in length and the nonrandom 
length fragments can have sizes in the range of 20 to 100 bases. 

The methods described above can also be modified with the use of 

20 uncleavable restriction probes. These uncleavable probes, synthesized with a 

restriction endonuclease resistant backbone such as phosphorothiuaie, 
boranophosphate, or methyl phosphorate, can be used to keep the target nucleic 
acid NLFs tethered together following restriction digest and can provide a different 
approach to purification of the NLFs. 

25 

Fragmentation Using Fragmenting Probes and SiNtiLE-SrRANi>-Spis:ci*ic 
Cleavage 

While the use of restriction endomicleases in various combinations and hi 
multiple digests can be an effective approach to fragmentation of the target nucleic 
30 acid, when a target presents long sequence lengths ( > 100 bases) that do not 

contain any restriction sites, alternative nonrandom fragmentation techniques arc 
preferred. Long > 100 base fragments will be difficult to probe with sufficient 
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mass accuracy to determine if a base change mutation has occurred. One way to 
control the size nf fragments is through the use of fragmenting probes and single- 
strand-specific endonuc leases . 

Fragmenting probes are defined as nonrandom length, single-stranded 
S oligonucleotides complementary to selected regions of a single-stranded target 

nucleic acid, and are used through hybridization to define and differentiate within 
the target nucleic acid regions thai are double -stranded versus regions that remains 
single-stranded. Following differentiation by hybridization the single-stranded 
regions are subjected to cleavage. As is die case for all of the methods described 

10 here that utilize oligonucleotides, the fragmenting probes may be comprised on 

DNA, RNA or modified forms of nucleic acid such as phosphorothioatcs, methyl 
phosphorates or peptide nucleic acids. Three examples of singie-strand-specific 
nucleases that can be used in these methods are Mung bean nuclease. Nuclease Si, 
and RNase A. These enzymes cut single-stranded DNA or RNA exclusively and 

15 act as both exo~ and cndonucleases. 

An example of how these probes and enzymes are used follows. A set of 
fragmenting probes of defined size and sequence are designed to hybridize to 
complementary regions of the target nucleic acid. It is preferable that the target 
nucleic acid be primarily if not entirely single-stranded. Use of a T7 or SP6 RNA 

20 polymerase transcription system for final amplification is a simple approach to 

producing the required single-stranded target nucleic acid. Asymmetric PCR can 
also be utilized to produce primarily single-stranded target. 

FIG. 7 shows how different portions of the single-stranded target nucleic 
acid ate hybridized to the oligonucleotide probes. Following hybridization, any 

25 regions of the target nucleic acid that remain single-stranded are cleaved using a 

singie-strand-specific cndo/exonuclease, such as SI Nuclease, Mung bean 
nuclease, or RNase A. The size of the single-stranded region can be as small as a 
single phosphodiescer bridge, i.e. the phosphodlestcr bond across from a nick. SI 
nuclease is capable of cleaving across from nicks. The end products ore double- 

30 stranded hybrids comprised of two equal length strands: one strand is a member of 

the set of nonrandom length fragments derived from the target nucleic acid and the 
other strand is a member of the set of fragmenting probes, wherein said NLFs are 
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hybridized lo said fragmenting prohes. Either these double-stranded hybrids or 
isolated single-stranded nonrandom length fragments derived from said target 
nucleic add can be used for MALDI-TOF mass speclromelrie analysis. 
Preferably* the analysis of the single-stranded nonrandom length fragments derived 
5 from said target nucleic acid provides a simpler mass spectrum. It should be 

noted that when the complementary strands are a mixed DNA/RNA hybrid there 
win be a significant mass difference between the two strands in all cases , making 
each strand more easily resolvable in the mass spectrum. 

Unlike the restriction endunucleasc nonrandom fragmentation approach, 

10 with this method it is possible to use a DNA/RNA hybrid providing a convenient 

route toward digesting the fragmenting probes after nonrandomJy fragmenting the 
target nucleic acid. Isolation of the set of NLFs from the set of fragmenting probes 
is another means to simplify the mass spectra. Because of the different chemical 
nature of the two strands of the hybrid, it is possible to utilize DNA- or RNA- 

15 specific enzymes to digest the fragmenting probes. As an example, DNase can be 

used to digest fragmenting probes comprised of DNA while leaving nonrandom 
length RNA fragments intact or RNase can be used to digest RNA probes while 
leaving nonrandom length DNA fragments intact, lc is also possible to utilize 
different chemistries to specifically digest one strand or the other. These 

20 chemistries include the use of acid to digest DNA or base to digest RNA as well 

as a multiplicity of other chemistries that can he use to cut modified versions of 
DNA or RNA. This differential cutting can be exploited to purify and analyze 
only one of die two strands as described in a later section. 

Thus, another embodiment of this invention is a method of detecting a 

25 mutation in a DNA fragment from a DNA/RNA hybrid nucleic acid comprising 

obtaining a DNA/RNA hybrid wherein the DNA/RNA hybrid comprises a single- 
strand of a DNA fragment hybridized to a single-strand of a RNA fragment, 
digesting the single-strand of RNA using a RNA-speciftc reagent, including RNase 
or a base, determining the mass of the single-stranded DNA fragment using mass 

30 spectrometry, and comparing said mass to a mass of a wild type single-stranded 

DNA fragment. Another embodiment is a method of detecting a mutation in a 
RNA fragment from a DNA/RNA hybrid nucleic acid comprising obtaining a 
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DNA'RNA hybrid wherein the DNA/RNA hybrid comprises a single-strand of a 
DNA fragment hybridized io a single-strand of a RNA fragment, digesting the 
single-strand of DNA using a DNA-specific reagent, including DNase or an acid, 
determining the mass of the single-stranded RNA fragment using mass 
spectrometry, and comparing said mass lu a mass of a wild type singie-stranded 
RNA fragment. These embodiments can aJso be applied to a set of DNA/RNA 
hybrids, and using the DNA-specific nr RNA-specific digestion to leave a set of 
nonrundum length fragments consisting of DNA fragments or a set of nonrandom 
length fragments consisting of RNA fragments. 

Complete digestion using restriction endonucleases produces a series of 
fragments that can be aligned end to end but do not overlap. With the use of 
fragmenting probes and single-strand -specific cleaving reagents described herein, 
one can design a set of sequence and size specific fragmenting probes thai can be 
used to produce a set of nonrandom length fragments such that one or more 
members of the set comprise a nonoveriapping nucleotide sequence and a 
nucleotide sequence that overlaps with a nucleotide sequence of another member of 
the set. The example shown in FIG. 7 uses a set of sequence and size specific 
fragmenting probes that overlap (e.g. split Into two seta of hybridization reactions) 
to produce an overlapping set of nonrandom length fragments. The set of 
nonrandom length fragments that overlap could be nested. By using a set of 
overlapping nonrandom length fragments to screen for a mutation, one can more 
narrowly localize the region containing a mutation. If two overlapping 
nonrandom length fragments both contain die mutation, as is the case in FIG. 7, it 
is then known that the mutation exists within the small region of overlap. 
Conversely t if only one of the overlapping fragments contains a mutation, it is 
known that the mutation cannot be in an overlapping region. This approach plus 
the ability to design certain fragmenting probes to be very small in size, e.g. 10 to 
20 bases (typical fragmenting probes win be anywhere between 10 and 100 bases 
in length), allows one to probe genetic regions that are known hot spots for 
mutation with greater detail. 

One variant of this method is to use single^strand-spccific chemical reagents 
as a means for cleaving a target nucleic acid target into a set of nonrandom length 
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fragments. Several base-specific cleavage chemistries have been identified that 
cleave the nucteic acid backbone at base-specific sites that are single stranded and. 
under optimal conditions, demonstrate zero or extremely reduced cleavage levels 
at base-specific sites that are double-stranded. As an option the target nucleic acid 
5 can be synthesized using one or more modified nucleotides in order to make the 

backbone more vulnerable to chemical cleavage. By using fragmenting probes to 
hybridize to a target nucleic acid at all sites except the specific locations where 
cleavage is desired* it is possible to limit cleavage to these single-stranded sites 
and create a sequence-specific set of nonrandom length fragments. The method, 

10 schematized in FIG. 8. can utilize one of a number of different chemistries that 

are known to be single-strand specific including hydrogen peroxide cleavage 
and/or 2-hyd«)peroxyteirahydrofuran cleavage at C. fP. Richterich et al. 
"Cytosinc specific DNA sequencing with hydrogen peroxide" Nuc. Acids Res. 23, 
4922 (1995); G. Liang, P. Gannet & B. Gold "The Use of 2- 

15 Hydroperaxytetrahydrofuran as a Reagent to Sequence Cytosine and to Probe Non- 

Watson-Crick DNA Strictures" Nuc, Acids Res, 23, 713 (1995)}. Target nucleic 
acids that contain cleavage-modified nucleotides can be made by incorporation of 
modified nucleotide triphosphates during an amplification or polymerization step. 
A second variant of this method is to create heterozygous hybrids between 

20 the wild type fragmenting probes aid die target nucleic acid. By using 

fragmenting probes comprised of wild lype sequence, any hybrids that form with 
mutant sequence containing a point mutation wfll create a base mismatch or bulge. 
If the mutation is a small insertion or deletion, a looped out sequence wfll occur. 
Witt) this heterozygous hybrid, it is possible to use one of the structure- specific 

25 enzymes or chemistries described in the following section to create a mutation- 
specific cleavage at the site of a mutation. An example of the pattern of 
nonrandom length fragments produced is shown in FIG. 9. This approach 
permits determination of the type and location of the mutation that has occurred. 
Also as win be described, performance of a mutation-specific cleavage relaxes die 

30 mass accuracy and resolution constraints, thus increasing the useful size range for 

the nonrandom length fragments to be analyzed with MALDI-TOF mass 
spectrometry to a range of several hundred bases. 
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Mutation-Specific Cleavage Using Structure-Specific Endonuoxases 
Another nonrandoin fragmentation technique involves the use of mutation- 
specific cleavage ac base mismatch regions, if present* using structure-specific 
endonucleases or single-suand-specific dcavajre. Creation of mismatch regions 
requires hybridization between a mutation containing, single-stranded target 
nucleic acid and a set of one or more single-stranded complementary wild type 
probes derived from wild type sequence. Wild type probes can be restriction site 
probes, fragmenting probes, or capture probes comprising wild type nucleotide 
* sequence that when hybridized to a complementary mutation-containing region of a 
target nucleic actd results in a base mismatch bulge or loop structure. A base 
mismatch will be created at the location of the mutation. In one embodiment, the 
mutation containing positive strand is hybridized to u complementary wild-type 
probe that comprises the entire negative strand, tn the preferred embodiment, the 
complex of mutation containing positive strand hybridized to one or more 
complementary, wild type nucleic acid probes is fragmented using either 
restriction endonucleases, or fragmenting probes coupled with a single-strand- 
specific cleavage reagent. Any base mismatch regions between the set of wild 
type probes and the set of NLFs can be specifically cleaved using ouc or more 
mismatch-specific cleaving reagents. Examples of these reagents include: 
smicturc-specific endonucleases such as T4 cndonuclease VH, RuvC t MutY. or the 
endomiclcolytic activity from the 5 T -3* exomiclease submit of thermostable DNA 
polymerases, amgle-strand-specific enzymes such as Mung bean nuclease, SI 
nuclease or KNase A, and single-strand-specific chemistries such as 
hydroxy taurine, osmium tetraxide, potassium permanganate, or peroxide 
modification of unpaired bases followed by a backbone cleaving oxidation step, 

This mismatch-specific cleavage is used to cleave the mu tan on-containing 
oonrandom length fragment at the site of the mutation, thus producing two smaller 
fragments from the larger mutation-containing fragment. This approach is an 
efficient and simple way to identify die exact location of a mutation as well as its 
type. The mismatch-specific cleavage used in combination with one of the 
nonrandoin fragmentation methods described herein can be used to fragment a 
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large (> 200 bases), single-siramled target nucleic acid into a set of smaller, mass 
resolvable nonrandom length fragments. 

Like EMC and CCM, the mismatch-specific cleavage approach utilizes a 
mismatch targeting reagent to cut at the point of mutation. The approach 
5 described herein improves upon the gel eiecirophoreRis-hased methods by focusing 

on relatively small fragments thai lake maximum advantage of the mass 
spectrometer's ability to detect the exact size of a fragment leading to the 
identification of the exact location and nature of a mutation. The EMC and CCM 
methods must be followed by DNA sequencing in order to fully characterize a 

10 mutation. Using the methods described herein, a mutation in a target nucleic acid 

can be detected and its location and nature determined without any sequencing. 

An example of how a structure-specific enzyme like T4 endoroiclease VII 
can be used for mismatch-specific cleavage is shown in FIG. 10. The First step 
involves two amplification reactions. First, a target nucleic acid suspected of 

15 containing a mutation is amplified- Second, the corresponding wild type target 

nucleic acid is amplified to create wild type probes. These two amplification 
reactions can be performed together in one tube if the target nucleic acid is a 
heterozygous mixture of mutant and wild type, For certain diagnostic procedures, 
it may be more efficient to produce the wild type probes separately prior to the 

20 screening process. The next steps involve fragmentation of the target nucleic acid, 

e.g. a multiple digest of the target nucleic acid using more than one restriction 
endomiclease. and a step in which the fragments are mixed, denatured, and then 
annealed. The fragmentation and denaturing/ annealing steps can occur in either 
older. The purpose of the denaturing/anneal big step is to produce a mixture of 

25 hybrid target nucleic acids. In a 50:50 mixture of mutant target and wild type 

nucleic acids, four different products result: 25% homozygous mutant double- 
stranded nonrandom length fragments, 25% homozygous wild type double-stranded 
nonrandom length fragments, and 25% cadi of the two forms of heterozygous 
mutant/wild type hybrid nonrandom length fragments. See FIG. 10 (illustrating 

31} die use of wild type NLFs as wild type probes to generate a base mismatch with 

mutant NLFs). The heteioz.yguu* nonrandom length fragments contain at least one 
base mismatch at the site of mutation, i.e. the pnint(s) of sequence variation 
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between mutant and wild type. The next step involves treatment of flic nonrandom 
lengch fragments with a mismatch -specific reagent that cleaves at the site of the 
base mismatch in the heterozygous mutant/wild type nonrandom length fragments. 
These new cleavages (the number of cleavage events will depend on the particular 
enzyme used) typically reduce the nonrandom length fragment containing the 
mutation into two smaller nonrandom length fragments. The 50% of the mixture 
that contains the homozygous double-stranded nucleic acid fragments with no 
mismatches will not be cleaved during the mutation-specific cleavage. 

Example schematic mass spectral plots are shown in FIG. 10B. An 
expected spectrum would show a reduction in the peak size of the nonrandom 
length fragment containing the base mismatch that is cleaved by the structure- 
specific endonuclease (e.g. peaks 32+(Mut), 32+(Wt). 32-(Wt)« and 32-<Mut)) 
and the introduction of several smaller peaks at lower masses than the mutant 
peaks representing the set of heterozygous mutant/wild type NLFs that contain 
base mismatches (see peaks 8+(Mut), 8+(Wt), 11-, 2t-<Wt), 2HMut), and 
24+). These peaks corresponding to the heterozygous NLFs containing base 
mismatches are reduced in intensity but continue to be present since only 50% of 
the molecules exist in the heterozygous form that can undergo the mutation- 
specific cleavage. 

It is possible to bias the population of the different 
heteruzygous/homozygous forms by performing the amplifications of the target 
nucleic acid asymmetrically. Thus, one can maximize the types of nonrandom 
length fragments yielding mutational data with the majority of the duplex fanned 
during the annealing process bung heterozygous positive (+) strand mutant and 
negative (-) strand wild type. 

While it is possible to observe similar patterns using gel electrophoresis 
techniques, the mass accuracy obtained by mass spectrometry provides the 
advantage of accurate determination of the nature nf the mutation and the ability to 
determine the size and older of the two nonrandom length fragments created by 
the mutation-specific cleavage. In the example in FIG. 101), the resulting 
mismatch- specific cleavage fragments arc represented by sizes 8, 11, 21, and 24 
nucleotides in length. Using electrophoretic techniques, it would be impossible to 
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differentiate the two mutant forms at 8 and 21 (fragments 24+ and 12- do not 
posses:; the mutant base and are identical in heterozygous forms C and D), nor 
would it be possible lo directly determine which fragment is upstream {toward the 
5' end) and which fragment is downstream (toward the 3' end), e.g. in the positive 

5 strand it is 8+ that is upstream from 24+ . By providing exact mass values, 

mass spectrometry allows these strands to be ordered based on mass value 
database comparison wilt the fragments expected from the known sequence of the 
wild type target nucleic acid. By completely identifying the location and nature of 
the mutation this mass spectrometry method eliminates any need for sequencing 

10 the target nucleic acid. 

FIG. 10B shows how the mismatch-specific cleavage event adds complexity 
to the mass spectra. In the example shown, there are several locations where 2, 3, 
and even 4 different NLPs have the potential to overlap in the mass spectrum, 
making the full spectrum difficult to resolve. As discussed previously, and shown 

15 in FTG. 5, the mass spectra can be greatly simplified by performing the mass 

spectrometry analysis on only flic + or the - strands of the nonrandom length 
fragments. For example, FIG. 1 1 shows the set of nonrandom length fragments 
that are derived by analyzing only the + positive strand of the mutant target 
nucleic acid. By eliminating the homozygous nonrandom length fragments that are 

20 not mutation-specifically cleaved and removing the negative strand from the mass 

spectrometry analysis, the total number of nonrandom length fragments to be 
analyzed can be reduced from 20 to 7, with no two mass peaks having the same 
number of nucleotides. Of course, m other situations, two peaks may be from 
nonrandom length fragments of the same length depending on the type of mutation 

25 present, but such situations will be infrequent 

This mismatch-specific cleavage, like the incorporation of mass-modified 
nucleotides, extends die usable mass range of the initial target nucleic acid for 
mass spectrometry analysis since the primary mass accuracy needs are in 
determining the reduced mass of the nonrandom length fragments created by die 

30 mutation-specific cleavage and not in determining <he mass of the other nonrandom 

length fragments that are unaffected by the mutation-specific cleavage. 
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It is not always necessary to fragment the target nucleic acid in tandem 
with mismatch-specif c cleavage if the size of the nonrandom length fragments 
created by the mismatch-specific cleavage is small enough ic> fall into the usable 
mass range with the necessary mass resuluhon and accuracy. Target nucleic acids 
as large as 200 base pairs will yield at least one nonrandom lengch fragment 
created by die mutation-specific cleavage wherein the nonrandom length fragments 
can be a size less than 100 base pairs, e.g. a 200 hp tai^er nucleic acid with 9 
mutation at position 135 will produce nonrandom length fragments of 65 and 135 
* after cleavage at the sice of base mismatch. 

Fragmeintation Using SmrcrirRE^reciFic Enponhclkases r ro Clkave a 
folded Target Nucleic Acid 

Another nonrandom fragmentation method of the invention involves 
providing a target nucleic acid that is either a positive or a negative single-strand: 
providing conditions permitting folding of the single-stranded target nucleic acid to 
form a three-dimensional structure having intramolecular secondary and tertiary 
interactions, and nonrandomly fragmenting the folded target nucleic acid with at 
ieaat one structure-specific endoirac lease to form a set of single-stranded 
nonrandom length fragments. A diagram of this procedure is provided in FIG. 12. 
Aq example of conditions that permit folding of the single-stranded target nucleic 
acid are beating 10 denaturation followed by slow cooling to permit annealing to 
form a thennodynamically favored secondary and tertiary structure. The 
structure-specific endomicleases include: T4 endonuciease VII, RnvC, MutY, and 
the eudonucleolytic activity from the 5*-3* exonuclcase subunit of thermostable 
DNA polymerases. 

An alternative Co the use of structure-specific endonucleases is the use of 
some of the same single-strand-spectfic chemical cleavage procedures describe 
earlier in the text. Because of the higher frequency with which these reagents 
might cleave relative to the structure-spec iftc endonucleases, it is necessary that 
the secondary and tertiary structures formed by the single-stranded target be more 
compact limiting the access of the chemical reagents to the various reactive 
nucleotides. Approaches to forming these more compact structures include 



WO 97/33000 PCT/&S97/03499 

43. 

and proteins. For mass spectrometry analysis, prior an purification methods such 
as precipitation combined with vigorous alcohol washes, filtering and dialysis, and 
ion exchange chromatography are unsatisfactory because they cannot eliminate 
unwanted nucleic acid fragments and normally do not remove all salts from a 
sample. Solid phase approaches such as glass bead capture under high salt 
conditions, biotin/streptavidin binding, direct solid-phase covalent linkage* and 
capture via hybridization to solid phase bound oligonucleotide probes can be used 
tn eliminate unwanted nucleic acid fragments but typically require high levels, of 
salt during many of die wash seeps, rendering the products less pure and 
compromised for mass spectrometry analysis. 

The purifications mechods of the present invention are better suited to mass 
spectrometry analysis of nucleic acids than Che prior art methods. First, the 
methods herein physically isolate selected sees of nucleic acids from a multiplicity 
of impurities including undesirable nucleic acid fragments, proteins, salts, that 
would result in a poor quality mass spectrum. Second, the meihods optionally use 
a solution comprising volatile salts such as ammonium bicarbonate, dimethyl 
ammonium bicarbonate or trimethyl ammonium bicarbonate in any of the steps, 
including hybridization, endnnucJease digestion or washing. These two differences 
are significant advantages over the prior an because: (1) physical separation uf the 
desired set of nucleic acid fragments for mass spectrometries analysis is better than 
the labelling methods of the prior art that do not physically separate the target 
nucleic acids from a variety of other impurities that interfere with an accurate 
mass spectrum; and (2) the use of volatile salts in any of the steps precludes die 
need for any wash step known in the prior an to merely remove salts or inorganic 
ions. 

Double Strand Fragment Capture Approaches 

There are a number of basic ways to purify DNA restriction products from 
salts and other small molecules including precipitation, filtering, dialysis, and ion 
exchange chromatography. While all of these methods are effective, they are not 
all equally usseful for removing amplification primers, residual DNA, i.e. genomic 
DNA, or any proteins used. In addition, none of Uie bask: approaches meets all of 
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the requirements of automation speed and cose. The approach that comes closest 
is the use of small ton exchange spin columns, which arc somewhat expensive and 
not simple lo integrate into an automated setup. These small ion exchange spin 
columns can, however, produce high quality nucleic acids for mass speeirometric 
5 analysis. A betteT alternative is the use of (magnetic) glass beads to 

capture/precipitate nucleic acids of a specific size range and allow them to be 
rigorously washed. However, this method, like all of the other prior art methods 
described above, does not allow lor the removal of unincorporated DNA primer 
'since they are of the same size as the oonrandom length fragments to be analyzed 

10 and cannot be simply differentiated. 

Another general approach to purification of double-stranded fragments is to 
directly capture the target nucleic acid and/or a set of nonrandom length fragments 
by one of three means: (A) hybridization to capture probes comprising a first 
binding moiety that specifically binds to a second binding moiety attached to a 

15 solid phase: (B) binding the target nucleic acid or the members of the set of NLFs 

each comprising a nucleotide sequence and a first binding moiety to a second 
binding moiety attached to a solid phase; or (Q direct covalent attachment of the 
target nucleic acid or the members of the set of NLFs to the solid support. Each 
of these methods has advantages nod disadvantages. 

20 (A) Hybridization to solid support bound capture probes is straightforward, 

specific, and can he made thennodynamicaliy and kineiically favored by 
optimizing the size and concentration of the capture probes. Optimization is 
necessary since die set of NLFs would generally prefer to hybridize to their 
complements rather than to the capture probes. (This approach also works well 

25 for single-strand isolation as described in the following section,) A variation is lo 

bind the probes to the solid phase after hybridization to target Both 
biotin/streptavklin and covalent approaches for linking the probes to the solid 
phase are feasible. The principal concern with this approach is that maintenance 
of the hybridization, especially during wash steps, requires relatively high level of 

30 salts and makes it more difficult to produce a salt-free product for mass 

spectrometry analysis. Solutions to this probkm include the use of relatively long 
capture probes to increase melting temperatures or the use of volatile salts that can 
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be removed prior to mass spectrometry analysis. The use of volatile salts is 
described in mote detail elsewhere. 

(B) Biotin coupling to stneptavidin (or avidin) requires that any Target 
nucleic acid or nonrandom length fragment to be captured contain a biotin. It is 
straightforward to capture the target nucleic acid because biotinylated primers can 
be used in the PCR amplif icaiiou. In order to capture all of the fragments after a 
restriction digest, it is necessary to incorporate biotin into all of the fragments. 
Three possible routes for biotin labeling are, (1) the inclusion of a biotinylated 

' nucleoside triphosphate during fragment synthesis (2) the use of a DNA 
polymerase to fill in at 5' restriction overhangs using a bioLinylated nucleoside 
triphosphate, and (3) the use of ligase to iigaic a biotinylated oligonucleotide at the 
restricted ends of the nonrandom length fragments, where the oligonucleotides »re 
either complementary to the restriction sequence overhangs or are capable of blunt 
end ligation. 

Each of the three approaches have their problems but are feasible. Biotins 
incorporated in method (I) may inhibit the restriction endonucleases to be used and 
prevent the use of structure-specific nucleases in a second mutation-specific step 
since the biotin may be recognized as DNA modifications to be excised. Method 
(2) Is more feasible but requires a preliminary cleanup step ro exchange the normal 
triphosphates for biotinylated ones. Restriction sties are limited to enzymes that 
produce 5' overhangs. Method (3) is more gencraiizabte than (2); its principal 
weakness is competition with larger fragments thai will want to relegate. 
However, this competition can be overcome by using an excess of the biotinylated 
linkers. 

(C) Hie approach of direct covalenl attachment of NLFs or target to a solid 
support faces many of the same challenges as the biotin/streptavidm approach but 
also includes the need to design specific. * hot* (i.e. fast and efficient) binding 
chemistry working with low concentrations of material. 

The target or members of a set of NLFs can be covalemly attached to a 
solid support using any of the number of methods commonly employed in the art 
to immobilize an oligonuclcoude or polynucleotide on a solid support. The target 
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or NUs covaleiuly attached to the solid support should be stable and accessible 
for base hybridization. 

Covalenl attachment of tlie target or NLFs to the solid support may occur 
by reaction becwecn a reactive site or a binding moiety on the solid support and a 
5 reactive site or another binding moiety attached to the target or NLFs or via 

intervening linkers or spacer molecules, where the two binding moieties can react 
to form a covalem bond. Coupling of a target or NLF to a solid support may be 
carried out through a variety of covaient attachment functional groups. Any 
suitable functional group may be used to attach the target or NLF to the solid 

10 support, including disulfide, carbamate, hydrazone, ester, N-funciionalized 

thiourea, functional hzed maieimide, streptavidin or avidin/biotin, meicuric-sutfide, 
gold-sulfide, amide, thiolester, uio> ether and amino. 

The solid support may be made from the following materials: cellulose, 
nitrocellulose, nylon membranes , controllcd-porc glass beads, acrylaraide gels, 

15 polystyrene, activated dexiran, agarose, polyethylene, funciionalizcd plastics, 

glass, silicon, aluminum, steel f iron, copper, nickel and gold. Some solid support 
materials may require functkmulization prior to attachment of an oligonucleotide or 
capture probe. Solid supports that may require such surface modification include 
wafers of aluminum, steel, iron, copper, nickel, gold, and silicon. Solid support 

20 materials for use in coupling to a capture probe include functionaltzed supports 

such as the 1,1 ^carhonyldiimidazoie activated supports available from Pierce 
(RockforcL, IL) or ftmctionalized supports such as those commercially available 
from Chiron Corp. (Emeryville, CA). Binding of a target or NLF to a solid 
support can be carried out by reacting a free ammo group of an amino-modificd 

25 target or NIP whh the reactive imidazole carbamate of the solid support. 

Displacement of rhe imidazole group results in formation of a stable N-alkyl 
carbamate linkage between the target or NLFs and the support. 

The target or NLFs may also be bound to a solid support comprising a gold 
surface. The target or NLFs can be modified at their 5'-end with a linker aim 

30 terminating in a thiol group, and the modified target or NLFs can be chemisorbed 

with high affinity onto gold surfaces (Hegner, et ■!., Surface Sci. 291:3*M6 
(1993b)). 



WO97O3W0 PCT/US97/0349* 

47. 

In all of the methods in which a solid-phase approach rs used, the double- 
stranded nonrandom length fragments can be rigorously washed to remove 
deleterious contaminants. Following washing it is necessary to release these 
fragments from tb* solid support for mass spectrometry analysis. The isolation of 
a set of NLFs may be performed on the same place thai is used within (he mass 
spectrometer. Both the capture probe hybridization and btotin/streptavidin 
approaches can use heat and/or pH denaturation to disrupt the noncovalem 
interactions and afford release of the set of NLFs bound to the solid support. 
* Alternatively, a cleavabic linkage can be incorporated between die first binding 
moiety and the NLFs. Any covalent coupling chemistry will need to he either 
reversible or it will be necessary to include a separate chemically cfeavable linkage 
somewhere within Che bound product. It may also be useful to use a chemically 
deavaWe linkage approach with the biotin/strepiavidin strategies so that release of 
the double-stranded fragments can be performed under relatively mild conditions. 
In all cases the cleavabic linkage can be located within the linker molecule 
connecting die biotin and the base (e.g.a disulfide bond in the linker), within the 
base itself (e.g. a more labile glycoside linkage), or within the phosphate 
backbone linkage (e.g. replacement of phosphate with a phosphoramidate). 

One alternative to these solid-phase approaches described above cs to 
capture the target nucleic acids prior to nonrandom fragmentation with one or 
more restriction endonucleases. Rigorous washes to remove polymerase, salts, 
primers and triphosphates required for amplification are followed by treatment 
with minima] amounts of restriction enzyme under very low salt conditions. This 
mixture is then directly analyzed in the mass spectrometer. Mass spectrometry 
can tolerate salts if their concentrations are low enough and a limited class of 
restriction enzymes can work under very low salt conditions. 

The low salt approach does limit the restriction sites that can be cleaved as 
part of the methods of detecting mutations. Many restriction endo nucleases 
require a significant level of salt. An attractive alternative to limiting the 
restriction end o nuclease cleavage reactions to low levels of salt is 10 replace the 
salts normally used with volatile salts. These salts, such as ammonium 
bicarbonate, duneftylammoniuin bicarbonate or trimethyl ammonium bicarbonate. 
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can be removed prior to mass specirumetric analysis through simple evaporation. 
Evaporation can be accelerated by placcmcnl of the sample in a vacuum, such as 
the mass spectrometer sample chamber, or by heating Che sampJe. 

5 Approaches to Captitring Single-Stranded Fragmlnis 

As described earlier, analysis ol single-stranded nonrandom length 
fragments is generally preferable since it provides a complete set of data with the 
minimal number of fragments and therefore simplifies the spectra and facilitates an 
Increase in the total length of nucleic acid that can be analyzed in a single assay. 

10 A number of approaches, as described above, can be taken toward the production 

of single-stranded fragments and their purification which includes the elimination 
of undesircd fragments. 

If DNA ttstriction endonncleases are used to produce the nonrandom length 
fragments, it is necessary that the target nucleic acid have a double-stranded form 

15 prior to restriction, or more specifically, that the restriction endonuclease 

recognition sites be located in double-stranded DNA. The alternative co having 
fully double-stranded DNA prior to restriction is to hybridize restriction site 
probes to single-stranded DNA, wherein the restriction site probes are 
complementary to the restriction sites for selected restriction endemic! eases . 

20 The basic known methods for DNA isolation - precipitation, dialysis, 

filtration and chromatography do not isolate single-stranded from double-stranded 
DNA. If these purification methods are employed it is necessary to add a separate 
step where single-strand isolation is performed. 

Isolation of a set of single-stranded NLFs can be accomplished using a set 

25 of capture probes. "Capture probes" arc oligonucleotides or polynucleotides 

comprising a single-stranded region complementary to at least one nucleotide 
sequence of the single-stranded NLFs to be isolated and a first binding moiety. 
The first binding moiety is capable of covalent or noncovalent binding to a second 
binding moiety attached to a solid support. The capture probes can comprise a set 

30 of capture probes, each of which contains single-stranded regions complementary 

to a corresponding member of a set of NLFa. A capture probe can also comprise 
a full-length single-stranded target nucleic acid that is complementary to the 



WO97/i3000 PCT/US97/03499 

49. 

nucleotide sequences of the members of a set of HLF.s. The capture probes can be 
bound to a solid support using the methods described above for binding a target or 
set of NLFs to a solid support. 

If restriction endonucleascs arc used to produce nonrandnm length 
fragments from DNA, the preferred method for isolating single-strand fragments 
from ihese products is to use a select set of capture probes. In one embodiment 
the capture probe consists of either fiijl length positive or full length negative 
strand where the strand lias been modified to contain a solid-phase binding moiety, 
* The process using full length negative strand modified to contain a blotin at the 5' 
end is illustrated in FIG. 13. The capture probe is made and the target nucleic 
acid is fragmented in two separate reactions. Following inactivation of the 
restriction enzymes the probe and double-stranded fragments are mixed, denatured 
and annealed producing a hybrid product of positive strand fragments annealed to 
full length negative strand capture probe. The capture probe can be bound to the 
solid phase via a biotin-streptavidin interaction prior to or following of the 
probe/fragment hybrid. Following the necessary wash steps the fragments are 
released and analyzed by mass spectrometry. Optionally, the fragments can be 
probed for a mutation-specific base-base mismatch and fragmented using one of 
the mismatch specific reagents described earlier. Illustrations of die different 
spectra produced without and with the optional second step are shown in FIG, 13. 
Note that after mutation-specific, mismatch-specific cleavage fragments that are 
distal from the solid phase binding site will be released into solution and washed 
away, therefore, not analyzed. Lose of these fragments can enhance the ability for 
mass spectrometry to quickly and easily identify the site of mutation. 

An alternative approach to using restriction endonucleascs is the use of 
fragmenting probes. These have been described in detail above, and allow the use 
of a target nucleic acid consisting of either DNA or RNA. The final ptvducts, 
using fragmenting probes and single-strand -sped fic nucleases, me double-stranded 
and thus without any additional steps do not themselves produce the set of single- 
stranded, noarandom length fragments necessary for analysis. However, there are 
several approaches that can be used to yield single-stranded nojirandom length 
fragments. 



WOWJ30UU PCT/USM03W 

SO. 

The first approach for producing single-stranded nonrandom length 
fragments is useful when the target is RNA and the probes are DNA or visa versa. 
In ihis case, the double-stranded produce arc RNA/OIMA hybrids and can be 
selectively treated with either a DNA or RNA specific nuclease lo yield the 
5 opposite NLF intact. Acid or base treatments are also an option. These single- 

stranded products can then be isolated using a number of conventional methods 
described above. 

A second approach to producing singJe-strandcd products for mass 
spectrometry Ls to attach the size and sequence specific capture probes to a solid 

10 support before or after hybridization to the target nucleic acid and the single- 

strand -specific cleavage. Since the probes are bound to the solid phase it becomes 
possible to capture, wash, and then selectively release the nonrandom length target 
fragments as single-stranded molecules. Following any wash steps, the nonrandom 
length target fragments are removed from the solid support by denaturation of the 

IS double-stranded complex. Once released, the single-stranded fragments can he 

directly analyzed by the mass spectrometer. 

One of skill in the art will know how to use capture probes to capture 
single-strands of a set of NLFs to a solid support in all the embodiments of this 
invention. For example biotmylated capture probes can be used to capture single- 

20 stranded fragments following cleavage of the target nucleic acid with restriction 

endoniicleases (optionally after neutralizing the restriction endoraicleases). The 
use of capture probes provides a relatively high level of flexibility to select winch 
set of NLFs to analyze at any given time. Large capture probes, capable of 
hybridizing to all or several different fragments, can be used to capture the 

25 fragments correlating to one strand of a target nucleic acid, e.g. a capture probe 

that is full length negative strand. A short capture probe or combinations of 
shorter capture probes can be used to selectively choose particular fragments from 
either strand to analyze in a given mass spectrometry sample. For example, if 
several fragments share similar sizes it might be preferable to analyze them 

30 separately. 

As another embodiment, a full length large I nucleic acid can be captured 
before restriction digestion using a capture probe that is nuclease resistant, lo this 
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case it is necessary 10 modify the capture probe, typically by changing the 
backbone composition from phosphate to a phospborothioatc, methyl phosphorate 
or botano-phosphace. [Uhlmann and Peyman, "Antisense Oligonucleotides: A 
New Therapeutic Principle, w Chemical Reviews 90{4):543-584 (1990) 
(incorporated by reference herein)] These forms of modification limit cutting on 
the probe strand, resulting only in (he nicking of the target molecule to create 
sequence-specific, nonrandom length fragments without creating any double 
stranded breaks. By leaving the modified probe strand intact, it is possible to 
* quickly capture the nonrandom length fragments to the .solid phase and purify for 
mass spectrometry analysis. 

All of these isolation or purification methods can be utilized in cases where 
a mutation-specific cleavage event is utilized. In order to present a base mismatch 
mutation for cleavage, a heterozygous, double-stranded molecule must be present. 
Typically this means that the fragmenting probe is composed of the wild type 
sequence and is hybridized to the target nucleic acid fragments containing the 
potentially mutated target nucleic acid. 
Volatile Salts 

Hie methods of this invention include the use of volatile sails, which is* an 
innovative alternative to NaCl, MgCfc, or other commonly used sails. Volatile 
salts are any salts that completely evaporate, leaving little or no salt residue in die 
sample to be analyzed in the mass spectrometer, fur example, the isolated set of 
NLFs, Volatile salts useful in the methods described herein include ammonium 
bicarbonate, dimethyl ammonium bicarbonate and trimethyi ammonium 
bicarbonate- These volatile salts are useful in many different aspects of the 
methods described herein, including use in hybridizing of nucleic acids, washing 
nucleic acids to remove impurities, and digestion of nucleic acids with 
eodonucleases or other enzymes. Kather than performing washes at reduced levels 
of nonvolatile salts, which might cause the nonrandom length target fragments to 
denature from a solid support bound oligonucleotide probe, h is a preferred 
embodiment to wash support-bound nonrandom length fragments in the presence of 
relatively high levels of NliHCQ,, e.g. 100 mM, and then to evaporate die 
volatile salt prior to analysis by mass spectrometry. Volatile salts are useful for 
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burfcr exchange in all cases where nucleic acids arc to be analyzed by mass 
spectrometry. 

Solid phase purification schemes involving DNA hybridization commonly 
described in rhe literature do not focus on tlie removal of sails since gel 
5 electrophoresis techniques are much more tolerant of salts than mass spectrometry. 

[S. Wang, M, Krinks & M, Moos *DNA Sequencing from Single Phage Plaques 
using Solid-Phase Magnetic Capture" Biotechniques 18, 130 (1995>; R. 
Sandaltzopoulos & P. Becker "Solid-Phase DNase I Fbotpriniing" Boehringcr 
'Mannheim Bkxhcnifca 3, 25 (1993); both incorporated by reference herein] 

10 These methods are primarily focus on the removal of strands complementary to 

template prior to enzymatic reaction and/or enzymes and unincorporated labeled 
nucleotides or primers following reaction. In such schemes residual salt levels can 
be as high as lOOmM NaCl and 25 mM MgCU. Mass spectrometry is intolerant 
of salt concentrations of this level. [T. Shaler et al. "Effect of impurities on the 

15 Matrix- Assisted Laser Desorption Mass Spectra of Single-Stranded 

Oligodeoxynucleotides* Anal. Chem. 68, 576 (1996)] The methods described 
herein using volatile sails provide an innovative approach to isolating and handling 
target nucleic acids and/or nonrandom length fragments for mass spectrometry 
analysis. 

20 The volatile salts can be removed from the 6ample prior to mass 

spectrometry analysis by evaporation. Evaporation of the volatile salts can be 
enhanced using a variety of methods, including use of vacuum, heating, laminar 
flow of a dry gas over the sample, or, in the case of ammonium bicarbonate (or 
dimethyl- or (rimethy lammonium bicarbonate), reduction of the pH by addition of 

25 an acid, including 3-HPA, can speed up the decomposition of the salt into 

ammonia (or dimethyl- or trixncthylammonia) and carbon dioxide. Volatile salts 
can be used in a variety of methods, beyond those described here, for preparing 
samples of any number of organic molecules, including proteins, polypeptides, and 
polynucleotides, for mass spectrometry analysis. 

30 Each of the nonrandom fragmentation techniques described herein can be 

used in combination with any of the isolation methods also described herein. 
Moreover die nonrandom fragmentation techniques can be used in combination 
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with each other, as one of ordinary skill in the art using the techniques described 
herein how to combine the different aspects of tbe invention For example, the 
mulaf iun-speciric cleavage technique can be combined with a set of restriction 
endonudease-cleaved KLFs. All of these methods and combinaiioas thereof can 
optionally include use of mass-modified nucleotides, internal calihranis and volatile 
salts. 

The kits described above for nonrandomly fragmenting target nucleic acids 
and detecting mutations in one or more target nucleic acids can also contain a 
combination of different means of nonrandomly fragmenting the target nucleic 
acids a& well as different means of isolating the nonrandom length fragments that 
are 10 be analyzed by mass spectrometry. 

The following examples are provided to illustrate embodiments of the 
invention, but do not limit the scope of die invention. 

EXAMPLES 

Example 1 , PGR Amplification of Source Nucleic Acids. 

PCR methods have been extensively developed during the last decade. An 
example protocol is as follows. A sample containing 10-10,000 copies of a source 
DNA molecule is mixed with two antiparallel DNA primers thai surround a targeted 
sequence, e.g. the coding region for a gene involved in carcinogenesis. The PCR 
mix is composed of: 8 fi\ 2.5 mM deoxynudeoside triphosphates, 10 pi 10X PCR 
buffer, 10 fi\ 25 mM MgCI„ 3 /d 10pM forward primer, 3 /il 10^M reverse primer. 
0.3 pi thermostable Taq DNA polymerase, 64.7 /d H,O f and 1 pi source DNA. The 
sample tube is scaled and placed into a thermal cycling device. A typical cycling 
protocol is as follows: 



Step 1 


95*C 2min. 


Step 2 


95'C 15 sec. 


Step 3 


55'C 15 sec. 


Step 4 


72*C Imin. 


Step 5 


repeat Steps 2-4 35 times 


Step 6 


72' C 5 min. 


Step 7 


stop 
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Example 2. Production of Single-Stranded Nucleic Acids by Asymmetric PCR. 

The basic PCR procedure can be modified in order to produce predominantly 
one of the two strands. TTiese asymmetric procedures involve modifying the ratios 
of the two primers, a typical ratio is t0: 1 . 

5 

Example 3. Production of Single-Stranded DNA via Biotinylated PCR Products. 

For the preparation of capture probes one of the two primers can be 
synthesized with si biottn moiety internally or at the 5' end of the oligonucleotide. 
Following a standard PCR. the double-stranded product can be bound to a solid-phase 

10 surface coated with streptavidin. For example, 10 pmol of double-stranded PCR 

product is mixed with 5 jd MPG 1 10 mg/mlj paramagnetic streptavidin-coated heads 
in a binding/washing buffer of 2.0 M NaCl, 10 mM TrisCl, 1 mM EDTA, pH 8.0. 
The solution is incubated for 15 min. at room temperature with mixing. Following 
incubation the tube is placed next to a high field, rare earth magnet and the 

IS paramagnetic beads with Che bound biotinylated PCR product are precipitated to the 

wall of the tube. The supernatant is removed, and the particles, outside the influence 
of the magnetic ficld t are resuspended into binding/washing buffer. The beads and 
wash solution an mixed and then subjected once again to die magnetic Held to 
precipitate the magnetic particles. The supernatant is once again removed and either 

20 the wash step is repeated or the alkaline denaturatkm step commences. In order to 

release the unbiotinylated strand from the double-stranded product the beads are 
mixed with an alkaline denaturation solution, 0,1 M NaOH. The beads are incubated 
at room temperature for 10 min. which denatures the PCR product and releases the 
unbiotinylated product into solution. The biotinylated strand, bound to the magnetic 

25 beads is precipitated from the solution under the magnetic field and unbiotinylated 

strand, now single-stranded, is transferred to a new tube with the supernatant. In an 
optional secondary step, the now single-stranded biotinylated strand can be (reed from 
the magnetic beads by boiling the beads in water for 10 min and transferred with the 
new supernatant after magnetic precipitation of the magnetic heads. 



30 
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Example 4* Mass Modiflcati n of Target Nuckfc Acids- 
Mass modification of the target nucleic acid is performed during the 
amplification step. One or more standard deoxynucleosidc triphosphates are replaced 
with modified deoxynucleoside triphosphates. As an example thymidine is replaced 
with a 5~alkynyl-substituted-2 1 -deoxyuridite triphosphate. Because the modified 
nucleotides may not be efficient substrates for DNA polymerase it may he necessary 
to increase the concentration of the corresponding triphosphate by a factor of 2 to 100 
over normal levels. 

* 

Example 5. Nonrandom Fragmentation of Double-Stranded Target Nucleic 
Acid* Using Restriction Endonucleases 

Specifically-sized, double-strand DNA products produced, for example, by 
PCR arc subjected to sequence-specific fragmentation using restriction endonucleases. 
As an example, 10 pmoles of a 500 base pair PCR product is treated with one unit 
each of the frequently cutting enzymes Mnl 1 and KinP I in the buffer recommended 
by the enzyme supplier. The reaction is incubated at 37 'C for 1 hour, followed by 
an enzyme-denaturing incubation at 65 4 C fur 15 min. 

Example 6. Nonrandom Fragmentation of Single-Stranded Target Nucleic 
Acids Using Small Oligonucleotide Restriction Site Probes in 
Combination with Restriction Endonucleases. 

Single-stranded DNA target, produced, for example, by asymmetric PCR or 
by the solid phase methods descrfoed in Example 3, is mixed with small 
oligonucleotide restriction probes complementary to selected restriction site locations. 
As an example, a set of 10 base long probes targeting the Has m recognition 
sequence, are synthesized with the sequence 

(SEQ ID NO: 1) 5* NNNGGCCNNN 3\ where the N's are chosen to allow the 
restriction site probes to fully complement the single-stranded target DNA at the sites 
where the Hae III recognition site (e.g. the probe (SEQ ID NO; 2) 5' 
GACGOCCAAA T to complement the target sequence (SEQ ID NO: 3) 5 f 
...TTTGGCCGTC... 3'). The mixture of target and probes, dissolved in die 
restriction buffer to be used in the cleavage step. Is denatured at 95 C and then 
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incubated at 32 "C (the average T m melting temperature for the probes) for 15 mid. 
allowing the probes to anneal to target and producing a mixture of si nglc-stiandcd and 
double-stranded regions within the target nucleic acid. The hybridized product is then 
cleaved at the double-stranded sites using one or more specific restriction 
5 endonucleases (e.g. Hae III), under conditions similar to those described in Example 

3. 

Example?. Nourandom Fragmentation of Single-Stranded Target Nucleic 
Awls Using Fragmentation Probes in Combination with Single- 
10 Strand-Specific Endonucleasea. 

Single-stranded DNA targci, produced, for example, by asymmetric PCR or 
by the solid phase methods described in Example 3, are mixed with fragmenting 
probes complementary to the target DMA. As an example, a mixture uf probes with 

15 sizes of 24, 26, 2B r 30, 32, and 34 each with sequences complementary to different, 

nonoverkpping regiuns of the single-stranded target DNA, The mixture of target and 
probes, dissolved m SI nuclease digest buffer comprised of 50 mM Na Acetate pH 
4.5, 280 mM NaCl, 50 mM MgCl ; , and 4.5 mM ZnSO^are denatured at 95' C and 
then incubated at 55 X (the average T B for the probes) for 15 min. allowing the 

20 probes to anneal to target and producing a mixture of single-stranded and double- 

stranded regions within the target nucleic acid. The hybridized product is then 
digested in the single-stranded regions using 1 U Si nuclease per jig target DNA, 
incubated at room temperature for 30 min. 

25 Example 8. Nourandom Fragmentation of Single-Stranded Target Nucleic 

Adds Using Mismatch-Specific Cleavage. 

Exflnpfc 9.1, QKrafral CJe*W at Mismatched Cytpyn? 

30 A heterozygous, mutation-containing DNA target is produced, either by PGR 

of a heterozygous source nucleic acid or by hybridization uf wild-type probes to a 
mutation- containing single-stranded target DNA. For solid phase capture and 
purification protocols the DNA probes are synthesized either chemically or 
enzymatically in such a way as to contain biotin moieties. Ry cither route, when a 

35 mutation is present a mismatch forms between Ihc target and wild type. A cleavage 



W0 97/33WM1 FCI7US97/Di49» 

57. 

solution of hydroxyiamine is prepared by dissolving 1.39 g of hydroxyiamine 
hydrochloride in 1 .6 mL of warm H,0 followed by the drop wise addition of 1 .75 mL 
of dtemyiamjoe to yield a solution of pH 6. A 6 mL sample of double-stranded DNA 
containing a mismatch site is mixed with a 20 mL of hydroxyiamine solution and chc 
resulting solution is incubated at 37*C for 30 minutes. The reaction is stopped by the 
addition of 374 mL of H,0 and the solution is removed cither by solid phase capture 
Of the reaction products using magnetic beads with washes performed in a similar 
manner to that described in Example 3 or by multistep ccntrifugation in a Microcon- 
* 30 ultrafiltration unit (Araicon). Hie reaction products are redissolved in 45 mL of 
H,0 and 5 mL of piperidine is added. The solution is incubated at 90C for 30 
minutes and then placed on ice to cool. A 300 mL portion of ILO is added and 
samples are either evaporated to dryness or purified by one of the two methods 
described in Examples 9 and 10. 

A typical mass spectrum obtained from the hydroxyiamine fragmentation at 
a paint mutation is shown in FIG, 14, The source DNA in this case is a section of 
the coding sequence for the p53 gene. A 1 34 base long PGR product is produced as 
in Example 1, amplifying p53 from codem 188 to 233 containing a heterozygous point 
mutation in codon 213, CGA->TGA, The forward primer containing a S'-biotinand 
a chemically labile linker within the primer, the reverse primer being a standard 
oligonucleotide, The mismatch containing PGR. product is treated with hydroxyiamine 
as described above, cleaving the mismatch at C in codon 213, The product is 
purified as described in Example 10, and analyzed as described m Example, II. A 
strong peak appears at the mass correlating to a product 75 bases in site identiiying 
that a C is present in a mismatch in the first position of codon 213. An analysis of 
mutation-free wild type, shown in FIG. 15, contains no mismatch and therefore nu 
cleavage occurs. 

Example 8 2. Chemical Cleava ge at Mismatched Thy mine 
DNA is obtained in a similar maimer to Example 8.1. The modification 
reagent is a 20 mM solution of KMnO, in deiotii2ed H,0. To 6 mL of double- 
stranded DNA containing a mismatch site is added 14 mL of the modification 
reagent. The solution is mixed gently at room temperature over the course of two 
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minutes during which time the solution turns slightly brown. A 20 ml- portion of a 
solution consisting of t.25 M sodium acetate pH 8.5 and containing 1 M 2- 
mcrcaptoethanol is added to stop the reaction, which result in the solution becoming 
immediately colorless. A 360 mL portion of 110 is added and the solution is either 
5 spun through a Microcon-30 ultrafiltration unii 2X, collected, and then evaporated to 

dryness or taken through a solid phase capture and wash protocol. The DNA is 
redissohred in 45 mL of H,0 and 5rnL of piperidine is added. The resulting soSution 
is heated to 90C for 30 minutes and then placed on ice to cool. After it cools, the 
solution is diluted by the addition of 300 mL of H,0 and then evaporated to dryness. 

10 As an alternative the cleavage products can be purified by one of the two methods 

described in Examples 9 and 10. 

A typical mass spectrum obtained from the KMnO* fragmentation at a point 
mutation is shown in FIG. 16. Ttic source DNA in this case is a section of the 
coding sequence for the p53 gene. A 134 base long PCR product is produced as in 

15 Example 1, amplifying p53 from codon 188 to 233 containing a heterozygous point 

mutation in codon 213 > CCA- > TCA. The forward primer containing a 5'-biotiu and 
a chemically labile linker within the primer, the reverse primer being a standard 
oligonucleotide. The mismatch containing PCR product is treated with KMn0 4 as 
described above, cleaving the mismatch at C in codon 213. The product is purified 

20 as described in Example 10 9 and analyzed as described in Example 11. A strong 
peak appears at the mass correlating to a product 75 bases in size identifying that a 
T is present in a mismatch in the first position of codon 213. Based on the data from 
the analysis in FIG. 14 and FIO. 16 tt is possible to confirm that a C->T mutation 
has occurred in this p53 sample, 

25 

Example 9. Purification of Nonrandom Length Fragments Using Capture Probes 

Nunranduw fragments are purified by annealing to a capture probes. The 
capture probe or probes consists of a sequence or sequences complementary to die 
selected target nonrandom length fragments. One method uses the a full length 
30 capture probe prepared as described in Example 3 t another uses a number of 

chemically synthesized capture probes prepared with biocn covalently attached. For 
cither method the procedure is identical. A 10 ixL sample containing a single fall- 
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length biotmylated capture probe or a mixture of smaller, synthetic, biotinylatcd 
capture probes is mixed with 10 of nonrandom fragments in an annealing buffer 
consisting of 3U0mM NaCI. lOmM Tris, and lmM CDTA pH 7.5. The mixture is 
heated in a boillng-H 2 0 bath for 10 min, and then quickly placed in an ice-ILO bath. 
The mixture is then transferred to a pre-heated thermal block ai 42 'C (the 
temperature is adjusted depending on the T c of the capture probe or probes) and 
incubated for 1 hour. The solution is then allowed to cool and then mixed with 
strepia vidin-coatcd magnetic beads. Binding to the beads takes place according to the 
" procedure described in Example 3- After the binding step, in place of the alkaline 
denaturation step, ihc bound, hybridized nonrandom fragments are washed with a 
volatile buffer such as I M NHJ1GO,. After 6 cycles of resuspension in 1 M 
NH3CO,, magnetic precipitation, and removal of the supernatant, the beads are 
resuspended in 10 of de ionized H ; 0 and heated to 65 X for 5 min, in order to 
release the nonrandom fragments from the bound biotinylated strand. The beads arc 
quickly precipitated from the warm solution and the supernatant containing the 
nonrandom fragments is transferred to another tube, The solution of nonrandom 
fragments is dried to remove excess volatile buffer and then analyzed by mass 
spectrometry as described in Example 11. 

An example of capture and analysis of nonrandom length fragments is shown 
in FIG. 17. The source DNA in this case is a section of the coding sequence for the 
p53 gene. A 184 base long PCR product is produced as in Example U amplifying 
p53 from codon 232 to 292 contain^ a heterozygous point mutation in codon 248, 
CGG->CAG, The double-stranded PCR product is digested using the restriction 
enzyme Mai 1 under conditions described in Example S. A frill length capture probe 
of the negative strand is produced as in Example 3. and the nonrandom length 
fragments derived from the positive strand are captured and purified as described 
above. The purified single-stranded fragments are analyzed as described in Example 
11. Shown in FIG. 16 are the 5 single-branded positive fragments produced from 
an Mnl I digest of the wild type 184 base long PCR product. By performing single- 
stranded isolation the five similarly sized negative strand fragments are eliminated 
from the spectra and all of the fragments are fully resolved. 
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Shown in FIG. 18 is a magnification of the spectra examining the 26 base long 
fragment that, in the heterozygous mutation case, contains the G->A mismatch. 
Shown are two clearty resolved peaks with a mass difference of 16 Da, exactly the 
difference between G and A and thus confirming the presence of a mutation. The 
5 ihird smaller peak correlates to a salt adduct of the high mass 26 base product and 

emphasizes the need for a process that stringently removes salt prior to analysis. 

Example 10. Alternative Purification Method for Mismatch-Specific Nonrandom 
Length Fragments. 

10 

The purification of nonrandom fragments that were produced by a mutatioo- 
specific cleavage, e.g. chemical cleavage at mismatch sites, can be achieved in an 
alternative way. In this case the fragmentation is performed un a PGR product that 
has one solid- phase capturaWe strand, e.g. containing biottn, and that is also able to 

15 be cleaved from the solid support. e,g. a bridging phosphorothioate linkage contained 

in the primer region [Mag et ai. f Nucleic Acids Res. 19(7): 1437-1441 (1991)1. As 
an example of this method, a PGR reaction is performed as described in Example 1, 
but with one of the primers containing a 5 f -end biodn modification and also a 
bridging phosphorothioate linkage located 5-5 bases from the 3^end, and the other 

20 primer a normal one. After amplification the PGR product is subjected to a mutation- 

specific fragmentation method directly since, for heterozygous mutations, mismatch- 
containing heteroduplexes are formed in dm during the PCR. In order to check for 
die possibility of a homozygous mutation, the sample is mixed with an equal amount 
of wild type control, annealed and then subjected to the fragmentation reaction. The 

25 material recovered from the fragmentation reaction* is purified and made single- 

stranded by the method described in Example 3. In this case, after the denaturing 
step, the products are released from the magnetic heads after several H 2 0 washes by 
treatment with 5 pL of 0.02 mM AgNO, and incubating at 45 *C for IS min. The 
Ag+ ions are sequestered by the addition of I of 100 mM DTT. The samples 

30 are dried to remove excess DTT and then analyzed by mass spectrometry by the 

method described in Example 11. 
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Example 11. Mass Spectrometry Analysis. 

The imcleic acid sample to be analyzed is typically mixed with an equal 
volume of matrix solution consisting of 0.5 M 3-hydroxypicolinic acid (3-HPA) and 
50 mM diammoniurn hydrogen citrate. Typically a 1 jiL portion of the sample is 
applied co the mass spectrometer sample stage and allowed to dry under a gentle 
stream of nitrogen gas at room temperature. When the sample has completely dried 
to form crystals (typically 5 min.) the sample is inserted into the mass spectrometer 
for analysis. The usual analysis conditions employ the use of a Nd: YAG laser 
operating at 266 nm with an average pulse energy of 50 inJ/em : . An average of 100 
laser shots is typically used to obtain a spectrum. 

All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each individual publication 
or patent application was specifically and individually indicated to be incorporated by 
reference. 

The invention now being fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the invention and the appended claims. 
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SEQUENCE LISTING 

fl) GENERAL INFORMATION : 

(i> APPLICANT; MOKFORTSr JOSEPH A. 

SHALER, THOMAS A. 
5 TAK, YUPING 

B3CKER, CHRISTOPHER H. 

(ii) TITLE 0? INVENTION : METHODS OF SCREENING 
NUCLEIC ACIDS USING MASS 

10 SPECTROMETRY 

liii) NUMBER OF SEQUENCES; 3 

(iv) CORjUSSPONDENCE ADDRESS: 

15 (A) ADDRESSEE: CCOLEY GODWARD LLP 

(B) STR3ET: FIVE PALO ALOTO SQUARE 

301)0 EL CAMINO REAL 

(C) CITY: PALO ALTO 

( D ) STATE ; CALIFORNIA 
20 (E) COUNTRY: USA 

(F) ZIP: 94306 

(v) COMPUTER READABLE FORM: 

{A) MEDIUM TYPE: floppy disk 

25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS -DOS 

<D) SOFTWARE: Patentln Release #1-0, Version 

#1,25 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: KARCH 4 f 1997 

(C) CLASSIFICATION: 

35 {viii PRIOR APPLICATON DATA 

(A) APPLICATION NUMBER: 60/012,752 
(3) FILING DATE: MARCH 4, 1996 

(viii) ATTORNEY /AGENT INFORMATION: 
40 (Al NAMUi JACKIE N. NAXAMURA 

(3} REGISTRATION NUMBER: 35,966 

(C) REFERENCE/DOCKET NUMBER: GNTR- 001/01WO 

(ix) TELECOMMUNICATION INFORMATION: 
45 (AJ TELEPHONE: 415-843-S214 
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(2) INFORMATION FOR SEQ IB NO: 1- 

(i) SEQUENCE CHARACTERISTICS: 
{A; LENGTH? 10 nucieoLides 
«B' TYPE: nucleic acid 

iCj STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYFfl : DNA, RNA 
(iiij HYPOTHETICAL: YES 

(iv) ANT J -SENSE: NC 

(v) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION; 

fvij SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
NKNGGCCNNN 
<3) INFORMATION FOR SEQ IV. NO: 2: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

vii} MOLECULE TYPE: DNA, RNA 
(ill) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(v) FEATURE: 

(A) NAME/KEY: 
<B> WCATION: 

<vi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GACRGCCAAA 



{4} INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 nucleotides 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) -rOPOLOCY: linear 



(ii) MOLECULE TYPE: DNA, RNA 
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Uii) KYPOTHETIW.: YRS 
Civ) ANT I -SENSE: NO 

(v) FEATURE: 

(A) NAME/KEY : 
(Bi LOCATION: 

(vi) SEQUENCE DESCRIPTION: SEC" ID NO: 2: 
TTTGGCCGTC 
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WE CLAIM; 

1. A method of detecting mutations in a target nucleic acid comprising; 
obtaining fix>m said target nucleic acid a set of nonrandom It- ngth fragments 

(NLFs) in single-stranded form, wherein said set comprises NLFs 
derived from one of either the positive or the negative strand of said 
target nucleic acid or said set is a subset of .single -stranded NLFs 
derived from bath the positive and the negative strand of said target 
nucleic acid, 

determining masses of the members of said set using mass spectrometry. 

2. The method of claim 1 wherein at least one member of said set of single- 
stranded NLFs optionally has one or more nucleotides replaced with mass-modified 
nucleotides 

3. The method of claim 2 wherein said determining step optionally further 
comprises 

utilizing internal seff-calibmits to provide improved mass accuracy. 

4. The method of claim 3 wherein said target nucleic acid is single-stranded and 
said obtaining step further comprises: 

hybridizing said single-stranded target nucleic acid to one or more sets of 
fragmenting probes to foira hybrid target nucleic acid/fragmenting 
probe complexes comprising at least one double-stranded region and 
a! least one single-stranded region, 

Donrandomly fragmenting said target nucleic acid by cleaving said hybrid 
target nucleic acid/fragmenting probe complexes at every single- 
stranded region with at least one single-strand-specific cleaving reagent 
to form a set of NLFs. 

5. The method of claim 4 wherein said set of fragmenting probes leaves single- 
stranded gaps between double-stranded regions formed by hybridization of said set 
of fragmenting probes to said target nucleic acid. 
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6. The method of tidbit 5 wherein said hybridizing seep further comprises: 
providing two sets of single-stranded target nucleic acid and 

separately hybridizing a first set of fragmenting probes to a first set of single- 
stranded target nucieic acid and a second set of fragmenting probes co 
5 a second set of single -stranded target nucleic acid, wherein said 

members of said second sel of fragmenting probes comprise at lcasi 
one single-stranded nucleotide sequence complementary to regions of 
said target nucleic acid that arc not complementary to any nucleotide 
sequences in any members of said first set of fragmenting probes. 

10 

7. The method of claim 6 wherein said members of said first set of fragmenting 
probes comprise nucleotide sequences that overlap with nucleotide sequences of said 
members of said second set of fragmenting probes. 

15 8. The method of claim 4 wherein said single-strand-specific cleaving reagent is 

a single-strand-specific endn nuclease. 

9. The method of claim 4 wherein said single-strand-spccific deaving reagents 
are single-strand specific chemical cleaving reagents. 

20 

10. The method of claim 9 wherein said single-strand specific chemical cleaving 
reagents are selected from the group consisting of hydroxy lamine, hydrogen peroxide, 
osmium tetroxide, and potassium permanganate. 

25 11. The method of claim 4 further comprising after said nonrandomly fragmenting 

step; 

hybridizing one or more of said NLFs to one or more capture probes, wherein 
said capture probes comprise a singlc-slninded region complementary 
to at feast one of said NLFs and a first binding moiety, 
30 binding said first binding moiety to a second binding moiety attached to a 

solid support, wherein said binding occurs either before or after said 
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hybridizing of said NLFs to one or more capture probes, isolating a set 
of single-stranded NLFs. 



12. The method of claim 4 wherein said fragmenting probes comprise a single- 
slranded nucleotide sequence and a first binding moiety, further comprising; 

after said nonrandomly fragmenting step, binding said first binding moiety to 

a second binding moiety attached to a solid support, and 
isolating said set of singlc*stranded NLFs. 

13. The method of claim 3 wherein said obtaining step further comprises: 
nonrandomly fragmenting said target nucleic acid with one or more restriction 

eadonucleases to form a set of NLFs, hybridizing one or more uf said 
set of NLFs or a subset thereof to one or more oligonucleotide probes, 
wherein each of said oligonucleotide probes comprises a nucleic acid 
comprising a single-stranded region and a first binding moiety, binding 
said first binding moiety iu a second binding moiety attached to a solid 
support either before or alter aid hybridizing step, and isolating said 
set or subset of single-stranded NLFs. 

14. The method of claim 13 wherein all of said oligonucleotide probes consist of 
one of either full-length positive or full-length negative single strands of said target 
nucleic acid and a first binding moiety. 

15. The method of claim 13 wherein said binding between said first binding 
moiety and said second binding moiety is a covalent attachment. 

16. The method of claim 13 wherein one binding moiety is a member selected 
from the group consisting of an antibody, a hormone, an inhibitor, a co-factor 
portion, a binding ligand, and a polynucleotide sequence, and the other binding 
moiety is a corresponding member selected from the group consisting of an antigen 
capable of recognizing said antibody, a receptor capable of recognizing said hormone, 
an enzyme capable of recognizing said inhibitor, a cofactor enzyme binding site 
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capable of recognizing said eu-iactur portion, a substrate capable of recognizing said 
binding Jigand. and a complementary polynucleotide sequence. 

17. The method of claim 13 wherein said isolating further comprises: 
5 washing said sex of NLFs bound to said solid support with a solution 

comprising volatile salts selected from the group consisting of 
ammonium bicarbonate dimethyl ammonium bicarbonate and trimcthyl 
ammonium bicarbonate. 

10 18. The method of claim 3 wherein said target nucleic add is single-stranded and 

wherein said obtaining step further comprises: 

hybridizing said single-stranded target nucleic acid to one or more restriction 
site probes to form hybridized target nucleic acids having double- 
stranded regions where said restriction site probes have hybridized to 
15 said single-stranded target nucleic acid and at least one single-stranded 

region* nonrandomly fragmenting said hybridized target nucleic acids 
using one or more restriction endonucleases that cleave at restriction 
sites within said double-stranded regions. 

20 19. The method of claim 18 further comprising after said nonrandomly 

fragmenting step, 

hybridizing said NLFs to one or more capture probes, wherein said capture 
probes comprise a single-stranded region complementary to at least one 
of said NLFs and a first binding moiety, binding said first binding 
25 moiety to a second binding moiety attached to a solid support, wherein 

said binding occurs either before or after said hybridizing of said 
NLFs to one or more capture probes r isolating a set of single-stranded 
NLFs. 

30 20. The method of claim 19 wherein said cleaved restriction site probes comprise 

a single^ stranded region complementary to half of a restriction endonuclease site and 
a first binding moiety , and further comprising alter said nonrandomly fragmenting 
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step, binding said firsc binding moiety to a second binding moiety attached 10 a solid 
support, and isolating a sec of single-stranded NIJ*. 



21 . The method of claim 3 wherein said target nucleic acid is single-stranded and 
said obtaining step further comprises: 

providing conditions permitting folding of said single-stranded target nucleic 
acid to form a three-dimensional structure having intramolecular 
secondary and tertiary interactions, 
nonrandomly fragmenting said folded target nucleic acid with at least one 
structure-specific cndonuclease to form a set of single-stranded NLFs, 

modifying either said target nucleic acid or said set or single-stranded NLFs 
such that members of said set of single-stranded NLFs comprise a single-stranded 
nucleotide sequence and at least one first binding moiety, 

binding said firsc binding moiety to a second binding moiety attached to a solid 
support, and 

isolating said set of single-stranded NLFs. 

22, Hie method of claim 3 wherein said target nucleic acid is single-stranded and 
said obtaining step further comprises: 

providing conditions permitting folding of said single-stranded target nucleic 

acid to form a three-dimensional structure having intramolecular 

secondary and tertiary interactions, 
nonrandomly fragmenting said folded target nucleic acid with at least one 
structure-specific endonuclease to form a set of single-stranded NJ-Fss, 

hybridizing one or more of said set of NLFs to one or more capture probes. 

wherein said capture probes comprise a single stranded nucleotide 

sequence and a first binding moiety, 
binding said first binding moiety to a second binding moiety attached to a solid 

support either before or after said hybridizing step, and 
isolating a set of single-stranded NLFs. 
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23. The method of claim 21 wherein said isolated set of single- stranded NLRs 
comprise any NLFs having a 5* end of said target nucleic acid. 

24, The method of claim 22 wherein said isolated set of single-scraiided NLFs 
5 comprise any NLFs having a 5 1 end of said target nucleic acid. 



25. The method of claim 21 wherein said structure-specific endonuclsase is 
selected from the group consisting of: 

T4 endomiclease VII , RuvC MulY, and the endomicleolytic activity from the 
10 5 % ~V exonudease subunh of therrao-stable polymerases, 

26. Hie method of claim 3 wherein said target nucleic acid is single-stranded and 
wherein said obtaining step further comprises. 

hybridizing said single stranded target nucleic acid to one or more wild type 
15 probes* 

nonrandomly fragmenting said target nucleic acid with one or more mutation- 
specific cleaving reagents that specifically cleave at any regions of 
nucleotide mismatch that fonu between said target nucleic acid and any 
of said wild type probes. 

20 

27. The method of claim 26 wherein said nomandomty fragmenting step further 
comprises: 

digesting said first set of nonrandom length fragments with one or more 
restriction endonucleases or 
25 cleaving said first set of ooorandom length fragments with one or more single- 

strand-specific cleaving reagents. 
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28. The method of claim 26 wherein members of said set of single -stranded NI J* 
comprise a single-stranded region and ai least one first bidding moiety, farther 
comprising after said nonrandomly fragmenting step, binding said first binding moiety 
to a second binding moiety attached to a solid support, and isolating a set of single- 
stranded NLFs. 

29. The method of claim 26 wherein said obtaining step farther comprise*: 
hybridizing members of said set of NLFs to one or more capture probes, 

wherein said capture probes comprise a single-stranded nucleotide 
sequence and at least one first binding moiety, binding said first 
binding moiety to a second binding moiety attached to a solid support, 
and isolating a set of single-stranded NLFs. 

30. The method of claim 26 wherein said obtaining stop further comprises: 
isolating a set of single-stranded NLFs comprise any NLFs having a 5 1 end 

of said target nucleic acid, 

31. A method of detecting mutations in a target nucleic acid comprising: 
nonrandomly fragmenting said target nucleic acid with one or more restriction 

endnmicleases to form a set of double-stranded NLFs. wherein said 
nonrandomly fragmenting further comprises using volatile salts in a 
restriction buffer, determining masses of the members of the set of 
double-stranded NLFs, wherein said determining does not involve 
sequencing of said target nucleic acid. 

32. A method of detecting mutations in a double-stranded target nucleic acid 
comprising: 

nonrandomly fragmenting said target nucleic acid using one or more 
restriction endonucleases to form a first set of nonnindom length 
fragments (NLFs), 

hybridizing members of said first set of NLFs to a set of wild type 

probes, 
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nonrandomly fragmenting one or more muubers of said set of NLFs 
with one or more mutation-specific cleaving reagents that 

specifically cleave ai any regions of nucleotide 
mismatch that fonn between members of said first set 
5 of NLFs and complementary members of said set of 

wild type probes, wherein said nonrandomly 
fragmenting step forms a second set of NLFs, and 
determining masses of members of said second set of NLFs using mass 
spectrometry, wherein said determining does not require sequencing of 
10 said target nucleic acid . 

33. He method of claim 32 further comprising 

obtaining said set of wild type probes by nonrandomly fragmenting a wild 
type target nucleic acid using die same restriction endonucieases used 
15 to form said first set of NLFs. 

34. The method of claim 33 wherein said steps of nonrandomly fragmenting of 
said target nucleic acid and obtaining said set of wild type fragmenting probes are 
performed simultaneously in a single solution. 

20 

35. The method of claim 32 further comprising before said determining step* 
isolating said second set of NLFs wherein said members of said second set 

comprise double-stranded nucleotide sequences and a first binding 
moiety, and binding said first binding moiety to a second binding 
25 moiety attached to a solid support. 

36. The method of claim 32 further comprising before said determining step, 
isolating said second set of NLFs wherein said isolating comprises hybridizing 

members of said second set of NLFs to one or more capture probes, 
30 wherein said capture probes comprise a single-stranded nucleotide 

sequence and a first binding moiety, binding said first binding moiety 
to a second binding moiety attached to a solid support. 



WO 97/33000 PC7mJK97M3499 

73. 

37. A method of detecting mutations in a target nucleic acid comprising: 

nonrandomly fragmenting said target nucleic acid, using a solution comprising 
one or more volatile salts to form a set of nonrandom length fragments 
(NLFs), 

determining masses of members of said set of NLFs using mass spectrometry, 
wherein said determining does not involve sequencing of said target 
nucleic acid. 

3S. A method of decreasing background noise comprising 

obtaining a sample to he analyzed by a mass spectrometer, 
washing said sample with a solution of volatile salts, and 
evaporating the solution of volatile salts from the sample. 

39. A method of obtaining nonrandom length fragments from a target nucleic acid 
comprising: 

hybridizing one or more sets of fragmenting probes to said target nucleic acid 

to form a set of hybrids, 
cleaving single-stranded regions of members of said set of hybrids. 

40, A kit for detecting mutations in one or more target nucleic acids in a sample 
comprising: 

(a) one or more sets of fragmenting probes, wherein said fragmenting 
probes are complementary to a sequence of one or more of said target 
nucleic acids; 

(b) a single-strand specific cleaving reagent; and 

(c) a solid support capable of isolating said single-stranded target nucleic 
acids that have been nonrandomly fragmented into single-stranded 
nonrandom length fragments. 



41. The kit of claim 40, wherein said single-strand specific cleaving reagent is a 
single- ^rand-specific chemical cleaving reagent selected from the group consisting of 
hydroxylamine, hydrogen peroxide, osmium tetroxide, and potassium permanganate. 
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42. The kit of claim 40, wherein said single-scrand specific cleaving reagent is a 
nuclease selected from the group consisting of Mung bean nuclease, Nuclease Si , and 
RNase A. 

43. A kit for detecting mutations in one or more target nucleic acids in a sample 
comprising: 

(a) one or more sets of restriction site probes, wherein said probes 
comprise a single -stranded sequence capable of hybridizing to a 
sequence of said one or more target nucleic acids; 

(b) one or more restriction endonucleases that cleave at restriction sites 
within said restriction site probes; and 

(c) a solid support capable of isolating said single-stranded target nucleic 
acids that have been nonraudomly fragmented into single-stranded 
nonrandom length fragments. 

44. The kit of claim 45, wherein said restriction endonuclease is a Class US 
restriction endonuclease. 

45. The kit of claim 43 , wherein said restriction site probe comprises two regions, 
a first region thai is single-stranded and complementary to a specific sequence within 
said target nucleic acid, and a second region that is double-stranded and contains a 
restriction recognition site for a Class IIS restriction endonuclease. 
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